ON THE LOCATION OF THE ROOTS OF CERTAIN TYPES 
OF POLYNOMIALS * 


BY 


J. L. WALSH 


When we study the dependence of a variable on & other variables which 
vary independently, our problem may be very much simplified if we can con- 
sider all or some of these independent variables to coincide and thus study the 
dependence of our original dependent variable on one new variable or at least 


on a number of new independent variables less than &. The present writer 
has recently published a theorem (Theorem II, below) which enables us to 
make a reduction of this sort in the study of the relations between the roots of 
certain types of polynomials. The present paper aims to prove Theorem I 


(below), which is a much more general result of the same nature, and to indicate 
various applications of that theorem. The applications given are extremely 
simple and follow from Theorem I with practically no further machinery.t 
The most interesting application is Theorem VI. 

Our problem is, more explicitly, to study the geometric relationship of the 
roots of a polynomial 


f(z) = (2 — a) (z — ag) «++ (2 — am) 
to the roots of a related polynomial 


= (z — by) (z — be) (z — Dn) 

* Presented to the Society, December 29, 1920, and April 23, 1921. 

t+ When this paper was first offered for publication, the writer believed Theorem I to be 
new. Professor D. R. Curtiss has kindly pointed out its connection with a theorem due to 
Grace and has indicated an entirely new proof of Theorem I; the reader will refer to Professor 
Curtiss’s note which immediately follows this paper. 

The point of view of the present paper in the proof and application of Theorem I seems to 
be new, and also the results obtained except where otherwise stated. 

This paper is the development of a short note published in the Paris Comptes 
Rendus, March, 1921, to which explicit reference is made later, and which contained in 
outline the proof of Theorem I. In the interval between the publication of that note and the 
publication of the present paper, there have appeared a number of other papers dealing with 
Grace’s Theorem. See Szegié, Mathematische Zeitschrift, vol. 13 (1922), pp. 28-55; 
Cohn, Mathematische Zeitschrift, vol. 14 (1922), pp. 110-148; Egervdéry, Acta 
Litterarum ac Scientiarum, Regiae Universitatis Hungaricae Fran- 
cisco-Josephinae, vol. 1 (1922), pp. 39-45; Fekete, same Journal, vol. 1 (1923), pp. 
98-100. 

The present paper has very little in common with these other papers, but with Szegé’s 
Theorem 9 the reader should compare our Theorem X, and with Szegé’s Theorems 13-15 
compare our Theorem IV and also Walsh, American Mathematical Monthly, vol. 
29 (1922), pp. 112-114. 
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whose roots are supposed to be known.* When we study the functional 
dependence of a root of f, say a for definiteness, on the b;, it may occur that 


without changing a; we can replace the n roots b; by n roots of a polynomial 
¢ which coincide at some point 6 which bears a simple relation to the roots 5;. 
Then we can study the dependence of a; on b instead of on b;, be, «++, dn, 
which is frequently a simplification of our problem. 

To the plane of the complex variable we adjoin the point at infinity; infinity 
is to be considered simply as an ordinary value of the variable. As a geomet- 
rical consequence we shall consider the term circle to include the possibility of 
a straight line. We shall have occasion to deal with circular regions, by which 
we mean a closed portion of the plane bounded by a single circle, that is, the 
interior of a circle, the exterior of a circle (including the point of infinity), a 
half-plane, a point, or the entire plane; the points of the boundary are always 


to be included in the region. 


I. A GENERAL THEOREM 
We proceed to the proof of our main result: 
TueoreM I. Let f(z) be a polynomial in z whose coefficients are polynomials 
linear in and symmetric in each of the sets of variables 


f 
{Q1, Qe, ak}, {B1, Bo, Bi}, Ao, Ag} 


Let these points {a;}, {Bi}, {Ac} le in circular regions C,, Cg, Cy. 
Then for any fixed values of these variables and of z we can always make all the 
{a;} coincide in C,, all the {B;} coincide in Cg, etc., without altering the value 
of f(z). 

The theorem also obtains if we replace f(z) by the quotient of two poly- 
nomials of the type described, except that we are to consider the conclusion of 
the theorem satisfied if these two polynomials vanish simultaneously.{ Of 
course if we have two polynomials of the type required, their sum is also of 
that type. 

We shall prove the theorem considering the {8;}, {yi}, --:, {Ai} always as 
fixed and showing that we can make the {a;} coincide in C, as stated. Then 
we can consider the {a;} fixed and coincident, and the {y;}, ---, {A} fixed; 
our former reasoning will show that the {8;} can be made to coincide. Con- 
tinued reasoning in this manner will evidently complete the proof of the 
theorem. It will be convenient to assume that the value of f(z) considered 

* Professor Curtiss has recently published a very interesting report on this general field, 
Science, vol. 55 (1922), pp. 189-194. 

t The coefficients of f(z) need not be homogeneous in each of these sets of variables, but 
each coefficient must be a linear combination of the elementary symmetric functions of each 
of these sets with coefficients linear combinations of the elementary symmetric functions of 


the other'sets. These linear combinations may, moreover, contain constant terms. 
t This is what actually occurs in the situation of Theorem II if we choose P inside C. 
+ 
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in the hypothesis is zero; this involves no loss of generality, for the addition 
of a constant term to f(z) does not alter the properties required. 

We now wish to show that the a; can all be made to coincide. If the region 
C,, is a point, the statement is trivial. If we prove the theorem where C, is a 
circular region not the whole plane we have proved it where C, is the whole 
plane. 

Consider the a; to vary independently and to have the region C, as their 
common locus. Then the relation 
(1) f(z) =0 
defines z as an analytic function of the a; and hence z will have a certain locus 
Z; this locus Z will be a closed point set since C, is closed. If (1) degenerates 
and does not effectively contain z at all, we introduce an auxiliary variable ¢ 
by placing 

F(¢) =f(z)-—¢=0; 
the new function F (¢) surely contains ¢ and we may reason with it as before 
for f(z). 

Let a be any point interior to C,. There is a certain locus Z of points z 
corresponding to the null-circle a as the locus of the {a;}. Make use of the 
auxiliary circle C of the coaxial family determined by a and C,. Let C com- 
mence with the position a and gradually enlarge and coincide with C,, the 
region bounded by C and containing a always considered as the locus of the 
{a:;}. The locus Z also grows larger and varies continuously with C (except 


as noted below), for the roots z of (1) are continuous functions of the {a;}. 


When the region C continues to enlarge, beyond C, if necessary, the locus Z 
eventually becomes the entire plane; this happens ordinarily before C has 
swept through the entire plane. The point z with which we start is a point 
of the locus Z surely when C coincides with C, and possibly before. 

The case that the locus Z does not vary continuously with C occurs only if, 
for some choice of the {a;}, (1) vanishes identically in z. When this occurs, 
the roots {z} of (1) do not vary continuously with the {a;}. The locus Z may 
enlarge suddenly and become the entire plane while its boundary does not 
necessarily sweep over the whole plane. 

Whether or not this phenomenon takes place, corresponding to our original 
point z there is some circle C such that for no circle C’, the region C’ smaller 
than and entirely contained in the region C, can z be a point of the locus Z 
of the roots of (1). This statement and in fact Theorem I as well are true if 
z is a point of the locus corresponding to a. The statement is true in any 
other case. For if there is a sequence of sets of points {a;} in a sequence of 
regions each contained in the preceding, there is at least one limiting set of 
points {a;} contained in all the regions and hence in the limit region. The 
relation (1) obtains for this limit set. 
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Our entire proof of Theorem I rests on the remark that when we fix z and 
all but two of the points {a;}, equation (1) becomes a homographic and 
involutory relation between the other two of these points and hence when one 
of these points traces a circle the other also traces a circle. The proof is com- 
plicated, however, by the possibility that this relation may degenerate and 
may effectively contain but one or neither of these variables.* 

We prove now a very special case of Theorem I, namely, that if a circular 
region C contains two points a; and a2 connected by a relation of the form 


(2) G2 + (a, + a2) + a3 = O, 
then C contains a root a’ of the equation 
(3) a; a” + 2a. a’ + az = O. 


Transform (2) by an auxiliary linear transformation so that the point at 
infinity is a double point of the transformation (a1, a) defined by (2). The 
line through a; and az is transformed into itself by the transformation (a, a), 
for the three points (a, a2, © ) go into (a2, a,, ©). When a is moved on 
this line, a, moves on the line in the opposite sense, for if the two points move 
always in the same sense, when a; is moved along the finite segment from its 
original position to the original position of a2, a2 has moved in the same sense, 
has at no time coincided with the point at infinity and hence has not reached 
the original position of a,. The points a; and a2 moving on the line joining 
them in opposite senses must have a double point D between them. It is 
thus evident that the region C which contains a; and a2 must contain D or the 
point at infinity and hence a solution of (3). 

The reasoning just used supposes implicitly that the relation (2) does not 
degenerate. If (2) does not contain a for a particular value of a;, that value 
of a is a solution of (3). If (2) is satisfied identically, any values a; and ae 
suffice. 

We now return to the proof of Theorem I in its general form. Let C be the 


* Thus we may start with the auxiliary polynomial 


F(z) = (2 — a) (2 — a2) +++ (2 — a) k>3 


and consider for the polynomial of Theorem I 

f(z) = F’(z). 
The relation 
(1’) f(z) =0 


if = a, = z contains effectively none of the variables az, azo. If a, 
we have ( 1’) reducing to 
(z — a) (z — ae) (2 — oxi) = 0, 


and hence we have no effective homographic relationship between az_; and ax. 
This example is not an unnatural one; indeed it is one of the first to which we should think 
of applying Theorem I; compare Theorem II and its applications to which a reference is given. 


1922 | THE ROOTS OF POLYNOMIALS 167 


smallest region of the kind already described for which z is a point of the 
locus Z. There is at least one of the points a; on C. We shall show that all 
those points can be made to coincide on C. 

Two points a; and a which are on C can be made to coincide either on C 
or within C. For we fix z and the remainder of the points {a;} and hence 
have a relation of type (2) between a; and a2. A double point of the trans- 
formation is either on the circle C or interior to the region C. 

Let us combine in this manner as many of the points {a;} on C as possible, 
so as to leave the smallest number of points {a;} actually on C. This number 
of points we denote by n; it is greater than zero and cannot be reduced. 

We shall prove by induction that all these n points on C can be made to 
coalesce on C. This fact is evidently true for two points, for no double point 
of the corresponding transformation can be interior to C. We assume ex- 
plicitly that whenever there are given n — 1 points of this sort on an are A 
of C, they can be made to coincide on A without changing any other point 
a;, or the relation (1); we shall prove this fact for n points. It involves no 
loss of generality to suppose n — 1 of these points at an end point X of A and 
the nth point at the other end point Y. 

We consider one of the points a, at X and the point a, at Y as connected 
by (1), while z and the other {a;} remain fixed. When a; moves on C, ap 
moves on C in the opposite sense, and the transformation (a1, a) has a 
double point D interior to A. Consider a; and a, to coincide at D. The 
new arc bounded by X and D contains the n points. We can make coincide, 
at a point X’, n — 1 points a;, X’ on the are XD, and the other point a, 
will be at D. Then we have the n points on an are A’ bounded by X’ and Y’, 
the arc A’ shorter than A and every point of A’ a point of A. Moreover, 
there are n — 1 points at X’, the end point nearest X, and the other is at 
Y’ = D, the end point nearest Y. 

We can continue in this manner successively to shorten our are A, and we 
can shorten it indefinitely. For let us suppose we have a sequence of arcs 


(using the natural extension of our former notation), XY, X’Y’, X”Y", ---, 


and suppose the points XY, X’, XY”, --- have a limit point z and the points 
Y, Y’, Y”, --- a limit point y different from x and of such a nature that we 


cannot make our arc of type A shorter than zy. We have n — 1 coincident 
points a;, successively approaching 2, and one point a, simultaneously 
approaching y. From the continuity of the left-hand member of (1) we can 
therefore suppose these n — 1 points a; to coincide at x and the other point 
a, to lie at y. Then the procedure formerly used shows that we can bring 
the n points into an arc shorter than zy all of whose points are points of zy. 
This contradiction completes the proof of our statement that the arcs A, A’, 
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A”, -++ shorten indefinitely, and hence the points a1, a2, «++, @, can all be 
made to coincide at the limit point P of those ares, with (1) still satisfied. 

If k = n, Theorem I is proved. If k > n, consider the transformation 
(a, a, ) defined as before. We have supposed that none of the points a1, a2, 
«++, a, can be moved from the circle C, so the transformation cannot contain 
a; and hence a, can be moved as near P as desired. Thus any other of the 
points n41, @n42, ***, a, can be moved as near P as desired without changing 
the value of f(z). Hence all these points can be made to approach P and so 
the value of f(z) is unchanged if all the a1, az, ---, a, coincide at P. 
Theorem I is now completely proved. 


II. SprecraL cases oF THEOREM I, WITH APPLICATIONS 
There are two results, essentially special cases of Theorem I, which are 
particularly interesting in their applications. 
TuHeoreM II. If the points ay, a2, +++, a lie in a circular region C and if 
z is exterior to C, the root of the equation in a 


lies in C .* 

Theorem II is in reality a special case of the extension of Theorem I where 
we consider the quotient of two polynomials of the kind described. The 
denominator polynomial cannot vanish, since z is exterior to C. 

For applications of Theorem II we refer to the citation already made and 
that made in the next following footnote. We proceed to another special case 
of Theorem I: 

TuHeoreM III. If the points a,, a2, «++, ax lie in a circular region C, the 
equation in a 


(4) (2 — a1) (2 — ae) +++ (2 — a) 


has at least one root in C. 

Theorem III can be proved independently of Theorem I in a manner precisely 
analogous to the proof of Theorem II (loc. cit.). This proof of Theorem III 
involves a transcendental transformation of the a- (or (2, y)-) plane: 


a-z=2+ et, 
and a study of the transform of C and certain centers of gravity in the (uw, v)- 


plane. ‘This proof gives in certain cases more detailed information than does 
Theorem I regarding the root a@ of (4). Thus if z is exterior to C, which is 


* Walsh, these Transactions, vol, 22 (1921), p. 102; Lemma I. 
Theorem II is closely connected with another more simple corollary of Theorem I, namely, 
that if k equal particles lie in a circle their center of gravity also lies in that circle. 


_ 
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the interior of a circle, we can write 
k log (z — a) = log (z — a1) + log (z — a2) + «+: + log (2 — ax), 


where all the logarithms involved have the coefficients of V— 1 in their pure 
imaginary parts lying between two numbers whose difference is less than 7. 
We add the remark, without proof at this time, that there is no other finite 
region bounded by a regular curve which possesses the property of the finite 
circular region indicated by Theorem III.* 

Theorem III can easily be extended to give some information concerning 
the location of all the roots of equation (4) ina. In fact, if a’ is any root, the 
other roots are given by 

= 
Z—-a=w(z-a’), 
where w is a kth root of unity. If we suppose that a’ is in C, all the roots of 
(4) must lie in the k circular regions obtained by revolving C about z as center 
of rotation through the angles 2j7/k,7 = 1,2, ---,k. In particular suppose 
one of these k circular regions, say C;, is external to all the other k — 1 circular 
regions. Then we can prove that C; has on or within it precisely one root of 
(4). For consider the points a1, az, -++, a, to move continuously always 
remaining in C and to coincide at the center of C. In this situation C; con- 
tains precisely one root of (4). The roots of (4) vary continuously with the 
points a1, a2, -**, a; none can enter or leave C,; during motion of the kind 
indicated; so in the original situation C; contained precisely one root of (4). 

However, our purpose is not primarily to study the roots of (4), but rather 
to use Theorems I and III as tools in proving more interesting relations. In 
preparation for these results we now prove the 

Lemma. If the interiors and boundaries of the two circles Cy and C2, whose 
centers are a; and a, and radii r; and rz respectively, are the loci of two points zy 
and z2, then as z, and zz vary independently, the locus of the point z which divides 
the segment (21, 22) in the constant ratio (m, : m2), 

Mo 21 + My 22 


(5) (m, — me Mm, Mz 0) 


is the interior and boundary of a circle C whose center is 


Ms + My Ae 
my + 
and whose radius is 
Mo Ty 


mM, + me + me 


* For the corresponding fact for Theorem II, compare Walsh, these Transactions, 
vol. 24 (1922), pp. 31-69; Theorem III. 
{ The terminology that z divides the segment (2;, z2) in the ratio (m, : m2) is usual when 
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If z; and z, are on or within C; and C, respectively, 


we have 


Me Oy + My _ ms ( 
+ Me + m2 


1 2), 


which is in absolute value less than or equal to 


Me Ty my Te 


m,+ Mme Mm, + me 


It remains to be shown that given any point z on or within C’, we can properly 
determine z; and z2. In order to do this, we merely place 


Mo 
mo(zi—a)_ M2 + m mi + ms 
+ Me = ( m, + Me Ms My 
my + Me my + | 
my, To 
my _ € M2 a + mM m+ My 
+ me mM, + Mme Ms My, Te 
+ my, + Me 


Then z; and 2 lie in the proper regions and satisfy (5), so the proof of the 
lemma is complete. 

This lemma gives no result if m; + m2 = 0; to treat this case we take (5) 
in the form 
(6) me(z— 2) = — m(z— 2), 


which is equivalent for our present purpose. If C, and C2 are mutually ex- 
ternal there is no point z different from the point at infinity which satisfies (6). 
If C; and C2 are not mutually external we may choose z; = z2, and every 
point of the plane satisfies (6). In the theorems below we give the general 
formulas for the determination of C with the understanding that when 
m, + m2 = 0, C is considered to contain no finite point of the plane or every 
finite point of the plane according as C; and C2 are or are not mutually 
external.* 

m,; and mz are real; whether m, and mz: are real or not, we simply understand that statement to 
mean that z is given by (5). 

This lemma was proved by the present writer and by the present method for m; and me 
real and positive, in a note in the Comptes Rendus du Congrés international, Strasbourg, 1920. 
See also Theorem II of the reference given in connection with the present Theorem II. 

* Thus, a careful statement of Theorem IV for the case A = 1 is 

TueoreM IVa. [If the poinis a,, a2, «++, ay have as their locus a circle C, whose center is a 
and radius r,, and if the points bi, be, «++, by have as their locus a circle C2 whose center is B 
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We now apply Theorem III and the Lemma to prove 

THEeoreM IV. If the points a,, a2, «++, ay have as their locus (the interior 
and boundary of) a circle C, whose center is a and radius r,, and if the points 
by, be, «++, by have as their locus a circle Cz whose center is 8 and radius ro, 
the roots of the equation 
(7) (2— (z— a2) (2— a) — A(z — (z — bg) (2 — = 0 


have as their locus the k circles with the respective radii 


ry + | 
|1 Auk | 


and centers 
an B 
(8) 
1 a Auk 
where A™* takes all the k values possible. Any one of these k circles which is 
external to all the others contains precisely one root of (7). 
It is evident that any polynomial equation can be written in the form (7), 
and in an infinite variety of ways. 
We prove Theorem IV as follows. Ifa point z is a root of (7), an application 
of Theorem III shows that 
where a and 6 are some points on or within C; and C, respectively. Hence 
we have 
z—az= A*¥(z—)b), 
oun 


where A‘ is some kth root of A, so z is on or within one of the k circles of the 
theorem. Variation of a and b independently and over C; and C; as loci shows 
that the k circles are the actual loci of the roots of (7). Continuous motion of 
the points a1, d2, +++, «++, bg so as to remain in their proper regions 
and radius rz, and if C; and C2 are mutually external, then the roots of the equation 
(7’) (2 — a1) (2 — a2) +++ (2 — ae) = (2 — di) (2 — be) (2 — be) 
have as their locus the k — 1 circles with the respective radii 
— 
and centers 
a — wf 
l1—w 
where w takes the k — 1 values of the kth roots of unity which differ from unity itself. Any one of 
these k — 1 circles which is external to all the others contains precisely one root of (7). 
If C, and C2 are not mutually external, the locus of the roots of (7') is the entire plane. 
The details of the proof of Theorem IVa are similar to those of the proof of Theorem IV 
and are left to the reader. 
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and finally to coincide shows in precisely the manner used in considering the 
roots of (4) that any one of the & circles which is exterior to all the others 
contains precisely one root of (7). 

Theorem IV reduces to the lemma when k = 1. 

A few remarks on the geometry of the situation are not out of place. The 
k points (8) all lie on the circle which is the locus of points z such that 


which circle is of course a circle of the coaxial family determined by a and B 
as null circles. Each of the & points also lies on a circle which is the locus of 
points z such that 


that is, the argument of the ratio (z — a)/(z — 8) is constant. These / 
circles belong to the coaxial family of all circles passing through a and 8, the 
family conjugate to the former coaxial family mentioned. These same & 
circles can be arranged in order so that at a and 8 each cuts its predecessor at 
an angle of 27/k. 

It is quite easy for us to obtain results concerning the roots of the derivatives 
of equation (7). The mth derivative (k > m 20), except for a constant 
factor, can be written 


(9) (z— aj) (z— ag) +++ (z — apm) 
— A(z— bi) (z— bb) (2 —bL,) = 0. 


The points a’, all lie on or within C, and the points 6’ all lie on or within C2, 
by the theorem of Lucas concerning the roots of the derivative of a polynomial. 
Hence all the roots of (9) lie on or within the / — m circles with the respective 


radii 


ry + | | 


Alm) | 


and centers 
All (k—m) 8 
{1/(k—m) 


where A’“-™ takes all the k — m values possible. These circles form the locus 
of the roots of (9). Any one of these k — m circles which is external to the 
remaiping k — m — 1 circles contains precisely one root of (9). This new 
result includes Theorem IV as the case m = 0. 


| 
| fllk 
iz— Bl A 
Z— a 
AVk 
z—- 
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We now state another theorem which results from Theorem III and which 
like Theorem IV refers to the sum of two polynomials.* 

THEeorEM V. [If the points ay, a2, «++, ay lie on or within a circle C whose 
center is a and radius r, all the roots of the equation 


(10) (z— a,)(z— a2) +++ —-A=0 


lie on or within one of the k circles which have the common radius r and whose 
centers are the points a + A*, where A‘* takes all the k possible values. If these 
k circles are mutually external, each contains precisely one root of (10). 

Of course any polynomial equation can be written in the form (10) and in 
an infinite variety of ways. The proof of the theorem follows the proof of 
Theorem IV. If C is the locus of a,, a2, ---, a;, the k circles form the locus 
of the roots of (10). 

Equation (10) is particularly interesting because the derived equation is 
independent of A. All the roots of the mth derived equation, m < k, lie on or 
within C. Theorem V can also be extended to a polynomial of the form 


l 


but this generalization as well as Theorem V itself is included in a more general 
theorem to be proved from Theorem I. 


III. SoME DIRECT APPLICATIONS OF THEOREM I 

We now proceed to derive a number of results directly from Theorem I 
instead of from Theorem II or Theorem III. 

If in Theorem I the polynomial f(z) contains merely one set of variables 
{a;} referring to a single circular region, the locus Z can have as its bound- 
ary no point other than a point of the curve traced by z when a = a1 = a2 
= +++ = q;, traces the circle C,. This leads to 

TuHeorEM VI. Let the common locus of all the roots of a polynomial 


f(z) = (z— a) (2 — ae) (2 — ax) 


be the interior and boundary of a circle C whose center is a and radius r. Then 


the locus of the roots of the polynomial 
(11) Aof (2) + Arf’ (z) + + Ana (2) + Ar f™ (2) 


is composed of the interiors and boundaries of the circles whose common radius is 
r and whose centers are the roots of 


(12) Ap (z—a)*+ (2 — + h(k—1) 2+ 1Ag. 


*If in Theorem IV we replace (7) by the sum of two polynomials not of the same degree 
and neither of degree zero, we are led to the determination of a locus which is not generally 
bounded by circles. Compare 4 of the next to the last paragraph of this paper. 
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Any of these circles having no point in common with any other of these circles 
contains a number of roots of (11) equal to the multiplicity of the center of that 
circle as a root of (12). 

A detailed proof of this theorem is quite simple. For any particular root 
z of (11) we consider the points {a;} to coincide at a onorinC. We have then 


Ag (z— a)* + (z — + k(kK—1) 2-14,=0. 


Since |a@ — a|=r, we know that z must lie in or on one of the circles de- 


termined. Conversely, any point z in or on one of these circles is a root of 
(11) for some choice of the {a;}; denote the center of the circle by 8. We 
need merely choose a; = a2 = +++ = ay = a+2-— 68, which is a point on or 
within C. 

If one of the circles C; has no point in common with any other of those circles, 
we may vary the a; at will and no root of (11) can enter or leave C;; these roots 
vary continuously if the a; vary continuously. In particular if 


we have the number of roots of (11) proper to C,. It is similarly true that 
a number of these circles C,, ---, C; having no point in common with any 
other of the circles contain a number of roots of (11) equal to the sum of the 
multiplicities of their centers considered as roots of (12). 

Theorem VI enables us to give a result concerning the location of the roots 
of the sum of any number of arbitrary polynomials; f(z) is to be chosen one 
of those polynomials of highest degree. 

If in Theorem VI we place Ap = Ap = = 4,+0, 
we have 

A circle which contains all the roots of a polynomial contains also all the roots 
of its derivative. 

This is essentially the theorem of Lucas: 

A convex polygon which contains all the roots of a polynomial contains also all 
the roots of its derivative.* 

The similar results obtained from Theorem VI by placing A; ¥ 0 but all 
the other A; zero (1 > 0) give the theorem of Lucas for the other derivatives 
of f(z). 

Theorem I applies just as well to the integral of a polynomial as to its 
derivative, but for a result of the nature of those just given it is necessary to 
choose a particular integral. Thus we shall prove 

Tueorem VII. Let the common locus of all the roots of the polynomial 


(z— a)(z—- ) ay, ) 


* In beth of these statements as well as many other similar results, the term contains may 
be interpreted to include the possibility of points on the boundary or to exclude that possibility. 
Whichever interpretation is considered in the hypothesis, the conclusion will be true under 
that same interpretation. 
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be the interior and boundary of the circle C whose center is a and radius r. Then 
the locus of the roots of the polynomial 


F(z) = 


is composed of the interiors and boundaries of the k + 1 circles whose centers are 
(1 — w)aand radii |1 — w\r, where w takes all the values of the (kk + 1)st roots 
of unity. Any of these circles which is entirely exterior to all the others contains 
precisely one root of F(z). 

If z is a root of F(z) for particular values of the a;, we must have for some 
choice of = a; = ag = = a, inoronC, 


cm 
= (l—w)a, 
where w*t! = 1. From this equation we have at once the desired inequality: 


lz —(1—w)al=|1 

which shows that z is on or within one of the k + 1 circles. On the other hand, 
any point z on or within one of these k + 1 circles is a root of F (z) for proper 
choice of the {a;}. In fact, if w is the particular root of unity corresponding 


to that circle we have merely to place @ = a; = a2 = +++ = a, where 


1—w 
the exceptional case w = 1 is trivial. 

The proof of the theorem is now complete except for the last sentence; this 
is proved in precisely the same manner as the corresponding statement in 
Theorem VI. Like Theorem VI, Theorem VII illustrates the remark made 
just previous to the statement of Theorem VI. 

We shall next prove further results concerning the roots of the derivative 
of a polynomial, making continued use of the Lemma. The special case of 
the Lemma where m, and mz, are positive is particularly simple; this special 
case leads to 

TuHeorEeM VIII. Denote by g(z) the polynomial 

(z — (2 — 22 )™ 
and by mf : mP (n = 1, 2, «++, m) the ratios in which the m distinct roots 
2™ of g™ (2) (the kth derivative of g(z)) divide the segment (21, z2).* Let the 
interiors and boundaries of circles C, and C2 whose centers are a, and az and radit 
r, and rz be the loci respectively of m, and mz roots of a polynomial f(z) which 
~ * The reader will easily prove from Rolle’s Theorem that no point z™ distinct from z; and 
Z2 can be a multiple root of g™(z). 
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no other roots. Then the locus of the roots of f{™ (z), the kth derivative of 
f(z), is composed of the m circles C™ whose centers are 


ay my” ae 
my” + 
and whose radii are 
my ry + my 
my? -+- 
If one of these circles C™ is exterior to all the others, it contains a number of roots 
of f(z) equal to the multiplicity of z™ as a root of g™ (z). 

The proof of this theorem is quite similar to the proofs of Theorems VI and 
VII and will therefore be omitted.* 

If a number of circles C™ of Theorem VIII overlap and are entirely exterior 
to all the other circles C™ , they contain together a number of roots of f (z) 
equal to the sum of the multiplicities of the corresponding roots z™ of g™ (z). 
A similar situation for one as for several circles arises in connection with 
Theorems VI and VII. In the future we shall indicate this general fact by 
saying that the number of roots of f (z) proper to a circle C™ is the multi- 
plicity of the point z™ as a root of g™ (z). 

In Theorem VIII we are in reality considering a polynomial 

f(z) = o(z)- 
where the m, roots of ¢(z) lie on or within C; and the mz roots of y (z) lie on 
or within C.. Then our conclusion refers to the polynomial 


When the result is expressed in this form it can be given a large extension. 
Under the same hypothesis with respect to ¢ and y we consider the polynomial 
(13) + Arg! YOY + YO? + + 
If z is a root of (13) for a particular choice of the roots of @ and y, there exist 
a and 6 in or on C, and C2 respectively and such that 
— 1) +++ (mg —k +1)Ao(z — a)™ (2 — 
+ my, + mo(me — 1) +++ (mg —k +2)A, (2 — (z — 


+ (my, —1)+ — 1) 


14 
) +++ (Mm, — + 3) Ae (z (z B 


+ (m —k +1)Ag (2 — (z — B)™ = O. 


* This theorem was published in a short note by the present writer, Paris Comptes 
Rendus, vol. 172 (1921), pp. 662-664, and its proof as there indicated contains the germ of 
the prodf of Theorem I. The special case of Theorem VIII for the case k = 1 had been 
previously proved by means of Theorem II; see the reference given in connection with that 


theorem. 
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Equation (14) is essentially an equation in homogeneous coérdinates 441, Me: 


mz (m2 — 1) (me —k +1) Ao * 
+ m,+ m+ — 1) +++ (me — +2) Ay 
(15) + my ( 1) * Me * (Me — (mM. — I: 3 ) Ao pen? 


+ m,(m, —1) ++: (my —k +1)Ag * = 0, 


n) 


whose distinct solutions, m in number, we denote by y{” : wy, n 
-,m. From (14) and (15) we see that for some n we have 


~ 


The Lemma immediately gives us certain circles in at least one of which must 
be located the point z. It is easily seen that these circles give the exact locus 
of z, and we have 
TuEeoreM IX. Let the interiors and boundaries of circles C, and Cz, whose 
centers are a, and a, and radii r; and rz, be respectively the loci of the m, roots of 
a polynomial @ and the mz roots of a polynomial Y. Then the locus of the roots 
of (13) is composed of the interiors and boundaries of the m circles C™ whose 
centers are the points 
BY? + 
and radu 
+ wy” | 12 tis 
+ | 
where p{” : wf” are the m distinct roots of (15). The number of roots of (13) 
proper to a circle C™ is the multiplicity of the ratio p{” : wy” as a root of (15). 
The methods that we have been using enable us also to obtain a result for 
the roots of the derivative of an entire transcendental function of the simple 


=1,2,-+-,m), 


type 
=e). Q(z) 


where 
P(z) = a( a0, 


Q(z) = +++ (2 — Ba), b#0. 
The roots of f’ (z) are given by the equation 
(16) Q(z) P’ (2) + (2) = 0. 


If all the a; and §; lie in or on a circle C whose center is a and radius r, and 
if z is a root of (16), we shall have for some @ and @ in or on C 


ap (2 = (z a)? q(z 
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Then we must have either 


or a 
ap(z -B)(z- a) +q=0. 
If this latter equation is verified we find from Theorem III that there exists a 
point ¥ in or on C such that 
ap (z 7)? + 0, 


where we = 1, 
ap 


It therefore follows that 


which is less than or equal to r in absolute value. Then all the roots of f’ (z) 
lie in C and the p circles of common radius r whose centers are the points 


g 


We state this fact and its obvious converse in 
THeorEeM X. Let the interior and boundary of the circle C whose center is a 
and radius r be the common locus of all the roots of the two polynomials 


P(z) 
Q(z) = b(z — Bi) (z — Be) 
Then the locus of the roots of the derivative of the function 
eP®) Q(z) 


is composed of C (unless q = 1) and the p circles whose common radius is r and 
whose centers are the points 


where w takes the values of the pth roots of unity. There are q — 1 roots of the 
derivative proper to C and one proper to each of the other p circles. 

In the proof of this theorem we have used not the fact that the interior and 
boundary of C is the locus of the roots of P(z) but that it is the locus of the 
roots of P’(z), which gives in reality a more general result. Thus if p = 1, 
there is no restriction whatever on the root of P(z). 

We now prove a theorem which is a generalization of our result concerning 
the roots of (16) corresponding to our former generalization of Theorem VIII. 


=0 
ap 
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THEOREM XI. Let the interior and boundary of a circle C whose center is a 
and radius is r be the common locus of the roots of the polynomials $1, ¢2, «++, 
of respective degrees ky, ko, +++,hq. Then the locus of the roots of the polynomial 


is composed of the circle C (unless nj; = k; fori = 1,2, «++, q) and the circles, 
ny + me +++ +n, in number, whose centers are the points 
. (ky 1) res (ky + 1) ho . ( ke 1) 
(ko — kg (kg —1) 
(hg — +1) A 


where w takes all possible values of an (ny + nz + «++ + nq)th root of unity, and 
whose common radius is r. There are 


(ki + he +--+ +hq) — (mi + me + +++ 


roots of (17) proper to C and one root proper to each of the ny + m2 + +++ + 1¢ 
other circles. 

Under the hypothesis of the theorem, if z is a root of (17) there exist points 
a’, a’, «++, a on or within C and such that we have 


(z — a’ — — — [hy (yy — 1) 
(ky —m +1) he- (he — 1) (he — me hg: (kg —1) 
nee (kg — Ng + 1) (z = a’ (z a!’ 
eee (z = 0. 
Then z must satisfy one of the equations 
(z — a’ )™ (2 — a!) — 
om ky (ky 1) (3, — Nq + 1)A = 0. 
If z satisfies this latter equation we know from Theorem III that there exists 
a point @ on or within C and such that 
(z — aq = — 1) (kg — mg +1)A. 
That is, we have the equation 
a— {a+ [ki (ki —1) --- (kg —mg +1) =a—a, 


where w"tt"z+""t™ = 1, The absolute value of the right-hand member is not 
greater than r, so z must lie in or on one of the mn; + m2 + --- + ng circles of 
the theorem. 

The detailed proof of the remainder of Theorem XI now requires no further 


analytical work and is left to the reader. 
Trans. Am. Math. Soc. 13. 
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The present paper aims merely to give some of the more immediate results 
that can be obtained from Theorem I. Further results can be obtained by 
extending in various directions: 

1. Application of these methods to other and more general types of 


polynomials. 
2. Extension of the results to other circular regions, namely, half-planes and 


the exteriors of circles. 

3. Expression of the results in a manner which shall be independent of linear 
transformation of the complex variable. 

4. Detailed determination and study of loci which naturally arise and which 
are of the same general nature as the locus determined in the lemma. For 
example, if the loci of a and 8 are two circular regions, determine the locus of 
the points z defined by 


For only the simplest polynomials f(z, a;, 8;) do we get a number of 
circular regions as the locus of the roots of f when the loci of the roots of the 
a; and 8; are circular regions. All the problems of the present paper have 
been characterized by a certain linearity, either the original problems them- 
selves or those problems after a simple transformation. In more general 
problems, where the locus of the roots does not ordinarily consist of circular 
regions, the exact locus should be determined and also a simple even if rough 
approximation. 

5. Application of these results and methods to the case of real polynomials.* 

The present writer hopes to return later to a consideration of these questions. 


* Thus the following theorem results from Theorem VI: 

Let all the roots of a polynomial f (z) of degree k lie in an interval of the axis of reals whose 
center is a and length !. Denote the roots of the polynomial 
(1’) + kAy + +++ + k(k —1) 
by 21, 22, ***, Ze, which roots are supposed real. Then all the roots of the polynomial 
(2’) Aof +Aif! +Asf™ 
lie in the intervals of the axis of reals whose centers are the points a +z; (i = 1,2, ++-,k) 
and whose common length isl. The number of roots of (2’) proper to each of these intervals is the 
multiplicity of the corresponding z; as a root of (1’). 

Harvarp UNIVERSITY, 
CamsBripce, Mass. (June, 1921.) 


A NOTE ON THE PRECEDING PAPER * 


D. R. CURTISS 


The results of Dr. Walsh’s paper are so interesting and capable of so many 
applications that it may be worth while to indicate a briefer proof of his funda- 
mental theorem which connects with earlier work in this field. 

First let us note that Walsh’s Theorem II is a form of Laguerre’s Theorem 
given below. In fact, if we write 


f(z) = (z — a) (z — ae) +++ (2 — ax), 


Walsh’s relation 


becomes 


or 


(1) 


Walsh’s Theorem II states that if the roots of f (z) are in the circular region C , 
and if z is exterior to C, thenaliesinC. This is precisely the result obtained 
by Laguerre as given on page 59 of volume I of his collected works, if expressed 
in non-homogeneous coérdinates as on page 57. 

We shall use another form of Laguerre’s Theorem (loc. cit., p. 57) to prove 
the underlying proposition on which Walsh bases his proof of his Theorem I, 
as follows: 

Every circle through any point z and its ‘‘derived point”? a as defined by (1) 
either passes through all the roots of f (z), or else has at least one root in the region 
interior to it, and at least one root in the exterior region. 

I have recently called attention} to the fact that this is a corollary of a 
theorem of Bécher’s on jacobians which has served as a starting point in 
Walsh’s earlier papers. 

With a slight change in Walsh’s notation, whereby we substitute z’s for a’s, 
the proposition from which Theorem I is deduced may be stated thus: 


* Presented to the Society April 14, 1922, under the title On the zeros of successive polars of a 
binary form. 
tScience, vol. 55 (1922), p. 193. 
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BY 
f(z) 
= 
f(z) 
os 
f' (z) 
|__| 
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Let z:, 22, «++, 2 lie within a circular region C, and let >-* denote the sum of 
all products of the z’s taken r at a time. Then the equation 
k(k —1) 


2! 


(2) + kay + + +++ +haiz +a, =0, 


where 
— a, Dita +a Fae Di 
has at least one root in C for each set of values ao, a1, +++, @—-1- 

This theorem is trivial if all the a’s are zero. In any other case we proceed 
by induction, assuming our proposition true for k = r — 1 and proving its 
validity fork = r. It is obviously true for k = 1. 

Let us place k = rin (2) andin (1). The resulting left-hand member of (2) 
we will use as f(z) in (1). We write 

f(z) = +4,, 
of’ (z) — f(z) = + (r — +4,], 
so that (1) becomes 
a, (r 1 27? + + a, 
+ (r + any 
Now consider the equation obtained by replacing @ in (3) by z,; cleared of 
fractions and arranged according to powers of z this is 


(4) (do + a) 27 + (r 1) (a, 2, + a2) 2"? + + (a,-1 2, + a,) 0. 


But if we compare this with (2), and note that 


— (G12, + = +a 
+a, +44 
— + zr + 41 (2r + 
+ de (ze + Hana (ze + Dr’) 
+ a) + (a1 2 + a) 
+ +++ + (G22, 


we see that (4) is a case of (2) fork = r — 1, and hence (4) has a root Z,inC. 

We now give z the value Z, in (3) and obtain the result that the “derived 
point” ais z,. By Laguerre’s Theorem every circle through Z, and z, either 
has on it a root of f(z) or has a root in its interior and one exterior to it; but 
Z, and z, are in C, so that we can draw through them a circle interior to C. 
Thus f(z) has a root in C, and on completing our induction we see that (2) 
has a root'in C. 
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The theorem we have just proved can not, however, be regarded as a new 
one. It is equivalent to the theorem of Grace:* 


If a form f (z) is apolar to a given form $(z), then it has a zero within any 
circle enclosing all the roots of @(z). 
If equation (2) and the equation 


(5) (z — 21) (z— 22) (2 = 0 


are made homogeneous, the condition that (2) and (5) be apolar is precisely 
the relation just following (2). Since this holds, Grace’s theorem requires 
that a root of (2) lie within any circle enclosing all the roots of (5), which are 
215 225 ° °°» Ske 

Grace’s theorem is a consequence of Laguerre’s when the latter is put in 
homogeneous form. If in (1) we replace a by £/n, z by 2/y, and f(z) by 
y* f(x,y), (1) takes the form 


(6) tf, + nf, = 0, 


which is the equation of the first polar of (£, 7) with respect tof(z,y). One 
form of Laguerre’s theorem (loc. cit., p. 59) may, then, be thus stated: 

If the point a = &/n is without a circle C containing all the roots of f, all the 
roots of the first polar of a with respect to f are within C. 

Consider a set of points 2;, 22, --+, z,, and let f; be the first polar of 2, with 
respect to f, let f. be the first polar of z2 with respect to fi, etc. If all these 
z’s are outside C’, the roots of all the polynomials f;, fo, --- , f,-1 are within C. 
If (2) and (5) are apolar, z, is the root of f,-1, and Grace’s theorem, in which 
f and ¢ may be interchanged, follows directly. 

The “mixed polars” f,, fe, --- are perhaps worth writing down in another 
form. Thus if we use the notation, for r < k, 


¥(z) = (z — 2) ( (z—2,) =eo2”? 


we have 


. 


r eee _ r f 


dy 


In non-homogeneous notation the equation f, = 0 becomes 


*The zeros of a polynomial, Proceedings of the Cambridge Philo- 
sophical Society, vol. 11 (1900-02), p. 352. See also Szegé’s paper, Bemerkungen 
zu einem Satz von J. H. Grace tiber die Wurzeln algebraischer Gleichungen, Mathematische 
Zeitschrift, vol. 13 (1922), p. 28, in which the “ Faltungssatz” is, in a sense, the com- 
plement of the theorem proved above. 
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dz dz™ 2 dz? 


r! 


If every root of f is within a circle C, and every root of y is without C , then every 
root of (7) is within C. 


NORTHWESTERN UNIVERSITY, 
Evanston, IL. 


DETERMINATION OF ALL GENERAL HOMOGENEOUS POLYNOMIALS 
EXPRESSIBLE AS DETERMINANTS WHOSE ELEMENTS 
ARE HOMOGENEOUS POLYNOMIALS * 


BY 


H. 8. EVERETT 


1. Introduction. Dicksont has shown that every binary and every ternary 
form, every quaternary quadratic, and a sufficiently general quaternary cubic 
form can be expressed as a determinant with linear elements, and that this 
property does not hold for any other form which is the general one of its degree 
and number of variables. The present paper is a generalization from the case 
of determinants with linear elements to that of determinants with elements of 
higher degrees. 

In the first part it is proved that if the number of variables is greater than 
three, no general form of degree rs is expressible determinantally, that is, as a 
determinant of order r with homogeneous elements of degree s, s = 2. The 
case of quadratic elements is considered in §2. The second part treats the 
determinantal expression of binary and ternary forms. In § 5 binary forms 
are expressed rationally as determinants. In §6 a general procedure is set 
up for ternary forms and the remaining sections show that this procedure is 
successful for ternary quartic and sextic forms. 


I. Forms with n> 3 ARE NOT EXPRESSIBLE DETERMINANTALLY 
2. THEoREM 1. When the number of terms in the general form of degree 2r 
in n variables (n = 2) exceeds [3n(n+1) — 2]r?+ 2, the form is not ex- 
pressible as a determinant whose elements are quadratic forms. 
Let D be any r-rowed determinant with quadratic elements homogeneous 
in the n variables x1, ---, 2,, which represents the general form of degree 2r. 
Its matrix M may be written in the form 


M Mu + My. + + U1 Ln Min + a3 Mo + + Mans 


where each M;; is a matrix whose r? elements are constants. We proceed to 
simplify the matrix M by the method of matrix multiplication employed by 


{ These Transactions, vol. 22 (1921), p. 167, hereinafter referred to as D. 
1D, p. 168. 


185 


Dickson.{ 
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Since D represents the general form, xj’ is present, whence the determinant 
of My is not zero. Thus M,, has an inverse M7! such that My, M7) is the 
identity matrix J. The new matrix 


N= xi v2 Nie Mes (Ni; Mi; M7) 


has as its determinant D/|M,;| and has unity as the coefficient of 27’. Next 
we choose a matrix B such that B-' Ny B shall have a canonical form P3.. 


Then our matrix reduces to 
P => B NB = x? I Xe Py. + + 2 Pus (Pi; Bo Ni; B). 


In contrast with the case of linear elements, we cannot show in our case of 
quadratic elements that the roots \1, ---, A, of the characteristic equation of 
Ny are all distinct. Hence we permit Pj. to have the most general form* 
with ¢ distinct \’s,¢ =r. Further normalization of P is effected by the matrix 
K commutative with Py. as defined by Dickson.t In AK Pi3 K we can 
specialize r — 1 elements. 

In case t= r no further specialization is possible. We have then, if we 
count the factor removed from D and the r distinct \’s of Py», 

1) —2)P—(r—-1) 

or [3n(n+ 1) — 2]r? + 2 parameters in the modified matrix of D. In case 
t < r the true number of parameters is less than that given above because we 
then have ¢ < r distinct \’s, and because further normalization is possible 
since the matrix A is not the most general matrix commutative with Py. 
Therefore, the number of parameters given above is the true maximum and our 
theorem follows. 

3. THEorEM 2. When the number of terms in the general form of degree rs 
in n variables (n = 2) exceeds 


' 


it is not expressible as a determinant whose elements are s-ic forms. 

The argument in $2 applies when D is a determinant of order r whose 
elements are forms of degree s = 2 in n variables, except as to the number of 
terms in the matrix M. This number is[n(n+1)--- (n+8—1)]/s!. 

4. ToeoreM 3. n> 3,8 22, the general n-ary rs-ic form cannot 
be expressed as a determinant of order r whose elements are n-ary s-ic forms. 


The number of terms in the general form of degree rs in n variables is 


')= (rs +1) (rs+n—1)-. 


rs 


(n—1)! 


* D, p. 178; Dickson, Linear Groups, 1901, p. 223; Bécher, Higher Algebra, p. 293. 
+ D, p. 179. 
t For s = 1 see D, Theorem II, p. 171. 
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Then by § 3 the general form of degree rs in n variables (n = 2) cannot be 
expressed as a determinant with s-ic elements if 


(rs +1)--- (rs+n—1) 


For n = 2 and n = 3, respectively, (1) becomes 0 > (r — 1)[(s —1)r—1] 
and 0 > (r — 1)[(3s — 2)r — 2], which hold for no value of r when s = 2. 


Now (1) holds if 


+1) (18+ 2) (28+ 3) g 


@) (n—1)! 


holds, where the left member of (2) has been obtained from that of (1) by 
replacing r by 2 everywhere except in the first two factors. We consider this 
relation first fors = 2. If n =7,r = 2, we can cancel from the numerator 


and denominator of the left member the factors 7, ---, (mn — 1) (which are 
absent if n = 7), and then (2) holds if 


(2r + 1) (2r + 2)n(n+1)(n + 2) (n+ 3) 
6! 
2| (2r+ 1) (2r+2), 


since the right member exceeds the value [n(n +1) — 2]r?+ 2 of S for 
s= 2. The last inequality holds if 


n(n+1)(n+ 2) (n+ 8)/180 = 4n(n4+ 1) 2; 
that is, if n(n — 7)(n+ 1)(n+ 12) + 360 = 0, which is obviously true 
forn=7. Forn=4,n=5,n = 6, respectively, (2) becomes 
(2r — 1)(2r — 2)(2r —3) > 0,7 (r — 1) (4r? + 24r? — 197 ++ 6) > O, 
(r — 1) (8r* + 68r? + 238r? — 107r + 30) > 0, 


each of which evidently holds forr =2. So(2)holdsfors = 2,r22,n24. 
We proceed now to show by induction on s that (2) and hence (1) holds for 
s$=2,r2=2,n=4. We assume that (2) holds for r= 2, n = 4, and any 
given value of s. We wish to show that (2) implies the corresponding in- 
equality with s replaced by s + 1, viz., 
[(s+ 1)r+ 2) (28 + 5) (28+n+ 1) 


—1)! 
(3) (n— 1) 


n(n+1)---(n+8) _ 


> S. 
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To derive (3) from (2) we must add to the right member of (2) 


| 


(s+1)! s! 
(+1)! 


and to the left member 


(28+ 5) (2a+n— 


(ar? + br te), 


where 

4(n — 1)s8?+ (n? + — 2(r? + 
b= 12(n — 2)8? + 3(r? + 5n — 10)8+ 3n(n4+ 1), 

S(n— 3)s+2(r?+n- 12). 


a 


To establish the induction we need 


(28+ 5) (28+ n—1)(ar+ br+c)/(n—1) 


(4) > 


Since the s-factors on the left are each greater than the corresponding s-factors 
on the right, and since br + ¢ > 0 for n = 3, (4) is implied by 


(9) a/(n— 1) = (s+ 2)(8+3)(8+ 4). 


Now (5) holds for n = 4 if it holds for n = 4 since the coefficients of s are all 
positive for these values of n. But for n = 4, (5) becomes 


21s? — 20s — 52 = 0, 


which obviously holds for s = 2. Our induction is therefore complete and 


Theorem 3 follows. 


Il. Binary AND TERNARY FORMS EXPRESSED DETERMINANTALLY 


5. Binary forms. Since the binary form of degree rs can be factored into 
rs linear factors, it is expressible as an r-rowed determinant each of whose 
diagonal elements is a product of s linear factors, and whose elements outside 
the main diagonal are zero. Dickson has expressed the binary form rationally 
as a determinant with linear elements.* Every binary form of degree rs can 
» be expressed rationally as a determinant of order r with s-ic elements: 


*D, p. 175. This determinant can be reduced by elementary transformations to the 
present one. 
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[ag +---+a,_, zy’ '+a,y'] 0 0 


[(— 1)’ y 000---¥ 


6. Ternary quartic form. The general ternary quartic may be given the 
form 


d+ f? — 2abed — 2abef — 2cdef, 


where a and b, c and d, e and f, are three pairs of bitangents of a Steiner set.* 
This function equals the determinant 


(ab — cd — ef) 2ed 
2ef (ab — cd — ef) 

The determination of the bitangents depends, however, upon the solution of 
equations of high order. In the case of the general ternary quartic form with 
no repeated factor Dickson’s determinant f of order r = 4 with linear elements 
can be reduced to a determinant of order r = 2 with elements of degree s = 2 
by the following elementary transformations. Multiply the third row and 
divide the fourth column by X4+ ¢4;2; multiply the second column and 
divide the first row by XY; + ez; multiply the first column by — z and add 
to the second column; multiply the fourth row by — z and add to the third 
row; multiply the second row and divide the second column by z. The 
resulting determinant is 


Ly le — 2? 

ly (¢39 ly — c422) — — Ugly — e432 
where 1]; = X;+ ¢;;2. No elementary transformations have been found 
which accomplish a similar reduction of Dickson’s determinant of order rs > 4. 

7. General ternary form. The remainder of this paper is concerned with 

a method of expressing a sufficiently general ternary form f of degree rs as a 
determinant of order r whose elements are s-ic forms. We may assume that 
f is irreducible. For, if f = fife where f; and fz are of degree r; 8 and rs 
respectively and expressible as determinants of order r; and rz whose matrices 
are M, and M2, then f equals the determinant of the matrix 


0 
O 


- * Miller, Blichfeldt, and Dickson, Finite Groups, 1916, p. 355. 
T D, p. 174. 
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where O is a matrix all of whose elements are zero. With Dickson* we choose 
a triangle of reference such that, for z = 0, f(a, y,0) = X, X2 --+- X,s, where 
X;= 2+ :yand \y, «++, Avs are all distinct. Then 


f= X, Xe Xvet x), 


where F;, is a binary form of degree rs — k. We shall attempt to express f 
as a determinant of the following special type: 


X, X,+ cee ZO1r 
X 541 Nos + ** Zher 


2Or1 ZOr2 X ave + 2Orr 


where each ¢;; is a ternary form of degrees — 1. There are} (rs +1) (rs+ 2) 
coefficients inf. In identifying f(2, y,0) with Xi, ---, X,., rs + 1 of these 
coefficients have been fixed. So the identification of D with f involves 
3rs(rs-+ 1) conditions on as many coefficients of the ¢;;. We therefore 
assign simple values to the remaining 4rs(r — 1) coefficients of the ¢;;; in 
-ach ;; (j > 1) we set the first s coefficients equal to zero with the exception 
of the first coefficient of each ¢;; (j = 1+ 1) which we set equal to one. 

We then expand D axially and consider the identification of the terms of 
this expansion, arranged in ascending powers of z, with the corresponding 
terms of any given form f. First, the terms linear in z will be identical with 
zF,(y, x) of f, where F; is any given binary form of order rs — 1, if they are 
equal for the rs values x = —A;y (= 1, +--+, rs). The resulting rs con- 
ditions involve the coefficients of the ¢;; of D. As a matter of fact, it will be 
seen in the following sections that they uniquely determine rs coefficients of 
the ¢;;. Next, the terms quadratic in z will be identical with 2 F2(y, x) of f, 
where F, is of order rs — 2, if they are equal fort x = —A:y (1 = 1, 

-, rg —1). The resulting rs — 1 equations involve coefficients of D. 
Proceeding in this manner to identify terms in z* (k = 1, ---, rs) we obtain 
a system of rs + (rs —1)+ --- +2+1= $rs(rs+ 1) equations in as 
many unknown coefficients of D. Since these equations involve arbitrary 
parameters F,, ---, F,,, and since by the implicit function theorem? solutions 
exist in the neighborhood of a point for which the jacobian of the equations 
does not vanish, the method outlined above effects the identification of D 
with f if the jacobian of these equations is not identically zero. 


*D, p. 172. 

t Or for any other choice of rs — 1 of the d’s. 

t Bliss, Princeton Colloquium Lectures, 1913, p. 7. Attention is called in this connection to 
MacMillan’s method of solution involving the introduction of a parameter and the expression 
of all solutions in power series of this parameter. See MacMillan, Mathematische 
Annalen, vol. 72, p. 180. 
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8. Ternary quartic form. A sufficiently general irreducible ternary quartic 


4 
f = Fi(y, x), A; =z+ dy, 


with distinct \’s can be expressed as the determinant 
(hora + ker y + X3 (hoot + hoo y + loo z)z 
The ten equations which arise in the identification of D with f follow. For 
brevity we write the frequently recurring expression \, — A» as (uv), and 
— Aw huy as ). 
(112) = Fi (1, — («= 1,2), 
| (1g) (27) (223) = Fil, — (7 = 3,4); 
(3¢) (40) + (112) (227) +25 (21t) = = 1,2), 
(13) (23) lee + (113) (223) + Ag (213) = Fe(1, — As); 
(112) loo + (227) liu = (212) he + Ai = F;(1, — (2 = 1,2); 
The first four equations are linear and determine uniquely ky, his, kee, he, 


since their determinant is 
Mz = mz = (31)? (41)? (32)? (42)? (21) (43), 


D= 


where 


1 
(31) (41) (32) (42) 
and is different from zero in view of the fact that the )’s are distinct. Since 
this system falls naturally into groups of 4, 3, 2, and 1 equations each, the 
Laplacian expansion of the jacobian of the system by minors of the last row, 
of the eighth and ninth rows, of the fifth, sixth, and seventh rows, and of the 
first four rows yields most readily the complete coefficient of the various terms. 
If the jacobian has its columns in the order heyy hi koe hoe kei, her 9 loo Li 
ley, Lig, we find on the main diagonal the only term of the expansion actually 
involving Its coefficient is 


Ms Mz Me My = (41)? (42)? (43) As, 


ma= (13) (23) (14) (24) 


where 
— 0 
M3; — 0 
As (13) (23)) 
We may assume that \; A: ¥ 0, since if one of the four distinct \’s is zero it 
may be taken initially to be Ay. Then since the d’s are distinct, the coefficient 
of lz; ke is different from zero and the jacobian does not vanish identically. 
9. Ternary sextic form, s = 3. A sufficiently general irreducible ternary 


(1 | 
J = Me = | 
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sextic 
6 
(6) f =X, X2--- > Fi(y, 2), AY; 
with \’s distinct can be expressed as the determinant D with cubic elements 
D= X, X34+ 26n 
X,X5 X6 + 


where $j = aj 2° + + cy + dij az + ej yz+fi2, and where in 
particular ay. = 1, bis = ¢2 = 0. The identification of D with f involves 
the following 21 equations. We use the abbreviated notations (wv) = Ay — dv, 
(uvew) = — Aw and [uvw] = dyy AZ, — Aw + 


= F,(1, —X,) (:=1,2,3), 
(1j) (27) (87) [227] = Fi(1, — Aj) (7 = 4,5,6); 


(42) (67) (112) + [1172] [222] — A? [217] = F.(1, (¢ = 1,2,8), 
(17) (27) (37) (229) + [117] [227] — AF[21j] = = 4,5); 
(47) (60) fir + [117] (227) 4+ [222] (112) 
— (211) — [217] (127) = 
(14) (24) (34) fos + [114] (224) + [224](114) 
— (214) — [214] (124) = Fs(1, — 4); 
[112] foo + [222] fir + (112) (227) — Ni for [212] fis 
— (127) (211) = Fy(1, (4 =1,2,3); 
(117) foo + (227) fin (127) for — (212) fie = F;(1, A; ) (2 = 
fir foo — fie for = Fe. 
With the columns of the jacobian arranged in the order ay1, 11, ¢11, 422, bee, 
C22, bor, C21, €22, doy, €21, dir, €11 for, far; dis, we find on 
the main diagonal the complete term of the expansion involving f3; bi: cde. 
Its coefficient is 
Ms Mz Mz My = (12)°(13)*(14)* (15)? (16)? (234 
X (24)* (25 )* (26 )? (34 )4 (35 (36 )? (45)? (46) (65), 


1,2,3), 


lI 


where 
1 
M.=-1, M= M;=|1 
As —Az 1 As 1 


1! |AG — 1 


— 3 1] — Xz 1 


| 
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We may assume that \; Az As ¥ 0, since if one of the six distinct \’s is zero, it 
may be taken initially to be \g. Then since the X’s are distinct this coefficient 
is different from zero and the jacobian does not vanish identically. 

10. Ternary sextic form, s = 2. A sufficiently general ternary sextic (6) 
can also be expressed as the determinant D with quadratic elements 


Xo + 2611 zh12 2913 
D=| X3 X4+ 222 223 
232 X5 X¢ + 


where Pij = hij x + ky + 2 and hy = hos = hig = kyo = keys = kos = 0. 
Let A be the determinant |/;;| of third order, and let Z;; be the minor of /;; 
in this determinant. In the following 21 equations which arise in the identi- 
fication* of D with (6) we use the abbreviations 


(uv) = — (uvw) = kuv — 


(32) (42) (52) (67) (117) = Fi (1, — (¢ =1,2), 
(1k) (2k) (3k) (4k) (33k) = Fy(1, — rx) (k = 5,6); 


(47) (52) (6¢) + (32) (47) (112) (332) 
+ (5i) (64) (112) (227) + (5i) (61); (217) = (i =1,2), 

(13) (23) (53) (63) lez + (13) (23) (223) (333) 

+ (13) (23)A3 (323) + (53) (63) (113) (223) 

+ (53) (63) (213)A3 = F2(1, — As), 

(1k) (2k) (3k) (4k) Is3 + (1k) (2k) (22k) (33k) 

+ (1k) (2k), (32k) + (3k) (4k) (11k) (33k) 
L = F,(1,—%) (k=5,6); 


(3¢) [ (112) — (310) + (382) 
+ (62) [ (110) le — he + (220) + 
+ (117) (227) (337) + (112) (327)A, 
+ (217) (33¢)A; + (317) = (4 = 1,2), 
(1k) (2k) [ (22k) — (32k ) los + (33k ) lee + Ax 
+ (3k) (4k) (11k) — (31k) hs + (33k) 
+ (11k) (22k) (33k) + (11k) (32k 
+ (21k) (33k + (31k) NE = Ae) (k = 5,6); 


< 


* See second footnote, §7. Here it was found advantageous to select the \’s in the order 
Ai, Az, As, As, As, As, Since this selection gives the minimum number of terms in certain equa- 
tions. Thus, the tenth and eleventh equations contain fewer terms than does the ninth 
equation. 
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(50) (62) Las + (30) (40) Loe + hs[ (21%) (320) — (227) (312) ] 

— (11%) (327) + (317) ] 

+ Is3[ (112) (227) + A; (217) ] 

+ [ (112) (312) lig + (327) + 

+ (330)[(117) (210) he + (220) ha + 

= Fx(1, — = 1,2); 
. Similarly for « = 6 with first term replaced by (16) (26) Zi; 


(110) Lar — (210) Lor + (312) + + (222) 
— (320) Lae + Ai Los + (330) L33 = (4 = 1,2); 


A= F,. 


With the columns of the jacobian arranged in the order ky, hy, hoo, hoo, 
kss, hss, hey loo , hs, kay hss Iss, hi Io le, los, hs, we find 
on the main diagonal the complete term of the expansion involving 
hi k33. Its coefficient is 


M, Ms Mz Mz Mz My = A5 (12)° (13)? (14)? (15 )* (162 
X (24)? (25 )* (26 (34) (35)? (36 )* (45)? (46 )? (56), 
where 
1 1 ri | 
M,=1, M,=— M;= 1 |, 
NG 


i — 3 
M,= 
— (15) (25) (15) (25) As 


M2 (16)(26) (16) (26)rg 


Ms = (13) (23) (53) (63) (51)? (61)? (52)? (62)? Ay Ae As Ag 
1 


M, = (31)? (41)? (51)? (61 )? (32)? (42)? (52)? (62)? (53)? 

63 )? (54)? (64)? . 

X (63)? (54)? (64) 1 


We may assume that ); Ao As Ag ¥ O since if one of the six distinct \’s is zero 
it may be taken initially to beA,. Then since the d’s are distinct the coefficient 
of this term is different from zero and the jacobian does not vanish identically. 

No attempt has been made to prove that for ternary forms higher than the 
sextic the corresponding jacobian is not identically zero. 
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GENERAL VECTOR CALCULUS * 


BY 


JAMES BYRNIE SHAW 


INTRODUCTION 


This paper presents results from various papers read before the Society 
during a period of several years. These are indicated in the footnotes. The 
calculus is independent of the number of dimensions of the space in which the 
vectors are supposed to be placed. Indeed the vectors are for the most part 
supposed to be imbedded in a space of an infinity of dimensions, this infinity 
being denumerable sometimes but more often non-denumerable. In the sense 
in which the term is used every vector is an infinite vector as regards its dimen- 
sionality. The reader may always make the development concrete by thinking 
of a vector as a function of one or more variables, usually one, and involving a 


parameter whose values determine the “dimensionality”’ of the space. The 
values the parameter can assume constitute its “spectrum.” It must be 
emphasized however that no one concrete representation is all that is meant, 
for the vector is in reality an abstract entity given by its definition, that is to 
’ in which the 
“operators” are generally not operators at all, since they have nothing to 
operate upon, but are abstract entities, defined by postulates. Always to 


“ec 


say, postulationally. The case is analogous to that of “ group’ 


interpret vectors as directed line-segments or as expansions of functions is to 
limit the generality of the subject to no purpose, and actually to interfere with 
some of the processes. It is sufficient to notice that in any case the theorems 
may be tested out in any concrete representation. The author desires to 
state that he discriminates between the terms function, values of a function, 
expansion of a function. 'The function is the law which enables one to ascertain 
the values. The table of values is not the function. The expansion is the 
system of coefficients of the basis functions by means of which we may have a 
particular representation of the function. Further a system of coefficients 
does not constitute a vector; such a system the author calls a multiplex. 

It is true that a multiplex appeals to many mathematicians as a sufficient 
entity to serve as vector, but a multiplex does not specify a vector till the basis 
(or in geometry the set of axes) is specified. A system of coefficients does not de- 
fine the same function when attached to powers of 2 as when attached to sines in 
a Fourier series; the vector (1, 2,3) has no meaning till the directions for the 


* Presented to the Society at various meetings, 1909-1921. 


Trans. Am. Math. Soc. 14. 
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numbers are assigned; and 1 — 2? is the same function whether expanded in a 
Fourier series or a Bessel series, though its multiplex is different. The essence 
indeed of a vector calculus lies in its ability to produce invariantive and co- 
variantive expressions, and it does this because it uses vectors directly and not 
some particular representation of the vectors. A calculus which produces 
formulas in x, y, z, even though they may be at once replaced by 2’, y’, z’, 
and preserve the same formulas, may be an invariant calculus, but it is not a 
vector calculus. At the best, symbols which stand for such expressions referred 
to axes for their meaning are merely short-hand symbols. 

The method developed here has been found by years of use to be flexible, 
smooth-running, simple, and to include all the systems of vectors now extant. 
Its expressions are all covariantive for the simple reason they have no depend- 
ence upon any system of reference. They are compact and readily interpreted 
in such applications as differential geometry. It furnishes the basis for integral 
equations, and the general theory of distributive operators. In the abstract 
the intention is to create a calculus of symbols which are defined by the rules of 
the calculus. Any entities that submit to these rules may be represented by 
the symbols so far as this calculus is concerned, though from other points of 
view they must be treated differently. These symbols stand for entities which 
are subject to certain processes. A process is not a function; it produces 
entities which are functions of the entities subject to the process, but the process 
is not the function. For instance we use a process called addition, and one 
called multiplication. ‘There are several called “A” processes which are not 
“multiplications.” The author is aware that some mathematicians like to 
call some of these A processes by such names as inner multiplication, outer 
multiplication, indeterminate multiplication, and the like, but he has not yet 
observed that this multiplicity of names has been any advantage in the develop- 
ment of the subject. 

On the side of vectors in finite space (as to dimensions) this method includes 
the Clifford algebras,* and might be supposed to be identical with the general 
treatment of Clifford. The question has been asked: Why not assume a system 
of units, i;, 12, +++, in, Whose squares are — 1, and such that 7, = — ig tp 
when p # g? All the formulas in the finite cases may be worked out from this 
point of departure. They might then be generalized to the case of a denumer- 
able infinity of units. This would indeed at least simplify much of the extant 
work on the uses of orthogonal functions, vectors in a “ Hilbert space,” ete. 
But when we take up the most general case of a non-denumerable space it 
would fail. The present development has no reference to any dimensions. 


* Consult Clifford’s Works; Joly, Proceedings of the Royal Irish Acad- 
emy (3), vol. 5 (1897), pp. 73-123; vol. 6 (1900), pp. 13-18; M’Aulay, Proceedings 
of the Royal Society of Edinburgh, vol. 28 (1908), pp. 503-585. 
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This must be well understood by the reader, for it is fundamental. The 
Grassmann treatments are hampered by their assumption that the “outer 
product” of all the basis units of the system is a numerical value. Without 
knowing the number of dimensions of the space the formulas cannot always be 
written. The formulas here are equally valid whether the vectors lie in a space 
of three, three thousand, a denumerable number; or a non-denumerable 
number of dimensions. There is no basis, for every vector is an entity unique 
in itself. It defines with its numerical multiples a linear space, any two define 
with their numerical multiples and under addition a binary space, any n an 
n-ary space,—under certain restrictions to be noted. 


FUNDAMENTALS 
1. DEFINITIONS 

1. Vectors. Vectors are represented in notation throughout by small Greek 
letters, or by expressions built up with small Greek letters, and other symbols. 
Capital Greek letters invariably mean operators. Roman (italic) letters mean 
numerical values selected from a given domain, or field. An exception some- 
times is furnished in using z in the ordinary sense. 

Vectors are defined by stating the conditions to which they are subject. 
These are as follows. 

(1) Each vector gives rise to a set of vectors, called its multiples, repre- 
sented by writing the symbol of the vector and a symbol for a number 
(or mark) chosen from the given field, thus from a we have aa, 3a, ta, and 
the like. No distinction is made between aa and aa. The multiples of a 
vector are as numerous as the marks or numbers of the field. Any one of the 
multiples may itself be taken as the initial vector and the others, including the 
original vector, are then multiples of it. No vector is a unit vector absolutely, 
but only relatively. 

(2) In case we have one more vector which is not among the multiples of the 
vector a, say 8, then the multiples of a and the multiples of 6 enable us to 
form a binary complex of vectors, indicated by za + y8, where x and y inde- 
pendently assume all values in the field. The vector £ = xa + yf is a unique 
vector, but is said to be dependent upon a and 6, or not to be linearly inde- 
pendent of them. We call the process of forming these vectors of the complex 
addition of vectors. We assume as a matter of definition that 


E=2a+ y8B = 2a, 


and further that, in the case of still a third vector, not in the complex of a, 
and B, say y, if =za+y8,6 =yB+27,0 =E+27,7 =2a+ 6, then 
in all cases ¢ = 7. 

This is called associativity of addition. 
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(3) If & = >> an an, then the multiples of £, as cf, are also to be formed 
from the same multiples of 2 a, 2% a2, ---, that is 


ck = etn Qn. 


We see that by this last condition 
sat yates 


(4) If 0 is included in the field, then we have vectors 0a, 08, --- and these 
are indistinguishable. 

It follows that we may consider a + 0£ = a@ in all cases. Further we see 
that the multiples of a, 8, y, ete., which determine a complex are to be con- 
sidered as in the complex. Hence the vectors which determine a complex are 
not unique. Whatever vectors are used to determine a complex are called a 
basis for the complex in case they constitute an irreducible set, in the sense that 
no one of them is included in the complex of the others. That is the same as 
saying that a,, a2, «++, a, form a basis for a complex in case 


Op = + Ae + + Qn 


only for x; = a% = +++ =a, =0. They are said then to be linearly in- 
dependent. 

(5) When the number of linearly independent vectors is not finite we 
indicate them by using a parameter attached to the vector symbol, sometimes 
as a subscript, as in the case of a denumerable infinity, thus a;, a2, +++, Ge} 
and sometimes as a parameter in a parenthesis, as a(7), as in the case of a 
non-denumerable infinity. ‘The complex in the first case is indicated by 


E = an On or at, a(n). 


The complex in the second case is indicated by 
= xr(ija(i)di. 

As an instance of each case, so far as our conditions go we may consider that 
functions of a single variable are vectors. We would then have, using z for the 
variable, 7 for the parameter, in the first case, an instance from expansions in 
orthogonal functions, 


E(z) Dd an on (z); 


in the second case, an instance from the expression of a function as a definite 


integral, 


£(z) = 


a 


| 


1922] GENERAL VECTOR CALCULUS 199 


When we apply vectors to functions we need to note there are further restric- 
tions on account of the problems of analysis. We are concerned here largely 
with the problems of algebra only. 

When we undertake to find the coefficients (from the field) which would 
enable us to express a given vector as belonging to the complex of a given basis 
(finding the components of the vector), in case it is possible to express the vector 
thus, that is, in case the vector does belong to the complex, we say we resolve 
the given vector on the given basis. This is exemplified in the process of finding 
components of velocity, or force, or vector field, or Fourier coefficients, or 
solving a linear homogeneous integral equation. The conditions, in the func- 
tion case again, that such resolution may take place, have been extensively 
studied in certain special eases. The study of such conditions as result from 
these analytic problems has been carried far by E. H. Moore, whose papers on 
General Analysis may be referred to. The definitions of norm, Hermitian 
square, generalized theorem of Pythagoras, etc., find their place in such 
investigations. 

(6) Vectors are further subject to conditions which are the main object of 
the present paper. These result from the formation of vectors of grade 2, 3, 

--, and their combinations into complexes, and combinations into complexes 
of mixed grades. These will be treated separately under the heads: the accretive 
process, the decretive process, the Hamiltonian process. The latter will also be 
valled the product. 

2. ‘THE ACCRETIVE PROCESS 

2. Alternants.* In general an expression is alternant when it changes sign 
with the interchange of two of its elements. The usage here is not contra- 
dictory to this but supplements it considerably. As an example of an alternant 
let us consider n functions of n different variables, giving us n? different func- 
tions 

a; (8;) (¢,j7=1,2,---,n). 


Then the expression + ai, (81) aj, (82) +++ (8, ), where the subscripts 
are the numbers 1, 2, --- , m in some permutation, all permutations occurring, 
and the sign is + or — according to the number of inversions, is an expression 
which changes sign if any two subscripts are interchanged. It is therefore an 
alternant. 

The alternants we introduce are defined by the following conditions: 

(1) From any n vectors we may construct an alternant, written A, - a; as 

- a, called a simple vector of grade n. 


* The remainder of part I contains the results of papers read before the Society as follows: 
Chicago Section, December 26, 1913, The two fundamental operations of general vector analysis; 
December 31, 1909, On Hamiltonian products; Southwestern Section, November 27, 1909, 
Scalars of lineal products. 
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(2) The interchange of any two vectors changes the sign. 
(3) If any one of the vectors is in a complex, say a; = ><a; 8; where the >> 
may be used also to include the non-denumerable case 


di, 


e 
then 
Ay * On Ap = aj An a By One 


It follows at once that if any two vectors are equal the alternant vanishes, 
that if any vector is in the complex of the others the alternant vanishes, and 
that to any vector may be added any vector in the complex of the others 
without altering the alternant. The distributivity property (8) is very 
important. 

(4) 4)-a@=a. Thealternants begin with 4,. We preserve the notation 
since it is valid in general theorems. 

(5) If Ay ay +++ = Othen for some set of values 21, 2%, 2, not 
all zero 

+ v2 Ae + + tn An = 0. 
This makes either equation a necessary and sufficient condition for the other. 
It precludes using as the alternant defined above any alternant expression. 
For instance in quaternions we might use as .42-a 8 the expression 
V - 0(a@)0(8) where @( ) isa linear vector function, but it would be necessary 
to choose 6 as non-singular. Otherwise the expression might vanish because 
a or 8 happened to be an invariant line for the root 0. Similarly we might 


take as -l. - a8 the expression 
b 
| [a(s)B(t) — B(s)a(t)]o(s)dsdt, 


but we should have to place restrictions upon what ( ) could be. Indeed in 
either example we can consider that we do not really have the simple alternant, 
but a function of the alternant, for they may be written respectively ®V - a, 
and ®[a(s)8(t) —B(s)a(t)]. The function might vanish when the 
elementary alternant does not. We limit the meaning of A, - otherwise, 
however, only by the postulates. 

3. Vector of grade n.* A basis of alternants of grade n gives a complex of 
grade n, and any member of the complex is a vector of grade n. A vector of 
grade n is indicated by V,,. It does not have to reduce to a single alternant, 
as for instance As - a8 + As - yé6 may or may not reduce to the form Ae - ef. 
Many vectors of grade n do however reduce to single terms which are repre- 
sented by the symbols A,. Such is the case to be defined, which is written 


* This is not the “ polyadic’’ of some authors. 
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An *** Where k #0. Such expressions have properties to be 
proved which may be stated in terms of the permutation groups under which 
they are invariant. In case k = 0 we might say that A, - a; a2 +++ a, Is 
invariant under the alternating group of permutations of its vectors, and 
changes sign for the remaining substitutions of the symmetric group. Other 
expressions constructed from an initial term by applying the substitutions of 
some group would be interesting and useful, but little study of them has 
occurred. 

4. Accretive process. The accretive process is now defined by the equation 


Apig * Ap * 1 Q2 *** Gy Ag Bi Bo By = *** Ap Bo; 


where p,q, are any integers. Hence if the alternants are interchanged the sign 
is multiplied by (— 1)?¢. 


3. THE DECRETIVE PROCESS 

5. Scalars. The term scalar was first applied by Hamilton to quaternions 
which could be included in the ordinary number scale. It is used here in the 
same sense but with the further application to a process which gives numbers 
(marks) belonging to the field, from vectors, whatever the grade of the vectors. 
We shall use Ao - to indicate such results. We define first: 

(1) Ag: An + @ Qn An Bo B, is a number or mark of the 
coefficient field. 


(2) Ag *** Ay Ay By Bo Bn 
This will be found to be a special case of a more general formula. In particular 
Ay - aB = Ao: Ba. 
(3) do: Ap ()4,() = 0, 
(4) An - 
Ao PR + Ao PS + + Ao OR + Ao QS 


for any expressions P,Q, ---, R,S,-+-+. That is, Ao is distributive. 

Instances of the scalar process are found in the expression called inner product 
by some mathematicians. In geometric vectors it is usually defined to be the 
product of the lengths of the vectors by the cosine of their angle, which angle is 
defined for the space involved. In functions we have 


b 
Ag: aB = f a(s)B(s)ds. 


We do not identify the scalar process A» with the scalar process in quaternions, 
though the latter is included. For we might take Ap - a8 = Sa@(8) where 
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6() is a self-transverse (symmetric, self-conjugate) linear vector operator 
which has no zero roots. Such a form is implied in the integration with a 
matrix defined by E. H. Moore in his General Analysis. 

Sealars of vectors of grades higher than one are expressible by scalars of 
vectors of grade one. 

6. Orthogonality. If the scalar of two vectors of any grade vanishes the 
two vectors are said to be orthogonal. A system of vectors may be mutually 
orthogonal, as for instance a system of mutually perpendicular vectors in 
space, or a system of functions such as a; = V2 sin irs, under the definition 


*1 
Ag: aaj = J a; (s) a; (s)ds. 
0 


Biorthogonality. Two systems of vectors a(i), 6(j) are mutually bior- 

thogonal in case we have 
Ay: a(i)B(j)=0, 
Ap: a(t)B(7) #0. 

Biorthogonality and orthogonality play an important part in the problem of 
expansions mentioned earlier. For instance they enable us to find the coeffi- 
cients under proper restrictions. 

7. Decretive process. We are now in a position to define the general de- 
cretive process. We define first the expression 

An-1 aA, By Bs Bn and ( = Ao Ay-1 Bi Bn 
1 
where the 7 written under {; --~- 8, means that 8; is missing from the expression. 
That is 6, --- 8, means --+ Bis1 +++ Ba, with the particular cases 
+ 
B, By, means --- B, and 8, --- 8B, means 8, This expression 
1 n 
is evidently a vector of grade n — 1. The decretive process reduces the grade. 
We define further 

We shall see later that we are here following a general law of permutations 
which is the same as the determinant law for change of sign, but we have to 
notice that in A» the two vectors may be permuted without change of sign. 

If we iterate the process just defined we would have 
An-2 A,-1 * A, Bi Bn 


This is evident since the multiplier of A,-2 - 8; - --+ - Bn will come from two 
J 
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sources; being, if we omit the signs, one way Apo + a2 8; + Ao+ a: 8;, and the 
other way Ao: a2 8; a1 8;. But if < j the sign for the first is (—)**’** 
and for the second (—)**’**, hence they have opposite signs. It is easy to 
see by induction that for r iterations we have 


The formula for operation on the right instead of the left is easily written. 
As an instance in functions, we may define A, - 61--+ 8B, to be the 
determinant 


+ Bi (81, ) Be (Sh, ) Bn 


a function of n independent variables. If now we multiply by a: (s), we 
may produce the decretive process by forming the sum, with proper sign, of the 
integrals we have, when we set in turn s = 81, 8 = 8, °++,8 = 8p. 

Since the vectors a enter the expressions above in an alternating manner 
we are warranted in the final definition, for the decretive process on two 
vectors of any grades, 

Ay+* a a, A, Bi Bn 
and we can show that 

An-r* An+ () () = (— An when r 
In either case if r = n we have the important formula 

| / 
Ao An Gn An * Bi Bn = Ao a; B;| 
The expression just found is sometimes called the Grammian when the a’s and 
B’s represent functions. In this case we define 


Ag: a8 = ds. 


For n functions the vanishing of the Grammian is a necessary and sufficient 
condition of their being linearly dependent. However, this is really because of 
the presence of the A, - of the n functions. It is really what vanishes. It 
is evident now that we may rewrite the definition of the general decretive 
process in a theorem 


th, 
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It is to be observed that the sign is to be obtained by the determinant rule, 
since we pass §;, over 7; — 1 preceding 6’s, 8;, then over 72 — 2 preceding 6’s, 
ete. The total inversions are then 7; + 72 + --- +7,+4r(r+1). This 
theorem is a particular case of a more general one to follow, which, however, 
is to be stated as a definition of both the accretive and the decretive processes 
for all cases. 

The expression Ag - An () An () is called the norm or square of the intensity 
of the vector A,, as for instance Ap - Ae - aBAe - af is the square of the in- 
tensity of Az - a8. Analternant divided by its intensity, if this is not 0, gives 
a unit alternant. 

By means of the two processes it is possible to construct the whole of a 
general vector calculus. * 

8. When Ag+ An ai An: Bi 0, we know from the prop- 
erties of determinants that if the rank is h, by proper addition of rows and 
columns we may reduce the determinant to one in which either n — h rows or 
n — h columns contain only 0, which in the present determinant will give 
vectors in these rows or columns that are in the complex of the vectors a or 
the complex of the vectors 8, respectively. It follows that n — h vectors in 
the complex of 8’s may be chosen so as to all be orthogonal to every vector a, 
or we may choose n — h vectors in the complex of the a’s which are all orthog- 
onal to all the vectors 8. It follows that the complex of the remaining h 
linearly independent vectors in the complex of the a’s, and the remaining h 
linearly independent vectors in the complex of the 8’s, define two complexes of 
h dimensions, which have a common incident complex of h dimensions. We 
must not assume that the complex of order h of the vectors a can be identified 
with that of order h of the vectors 8, since each vector of the first may be of 
the form S°} ai yi + &,j = 1, +++, 4, and the vectors of the other complex of 
the form yi yi + = 1, +--+, h, where the vectors ¢ and are orthogonal 
to all the vectors y, and all vectors ¢ are orthogonal to all vectors &. The Ao 
process leaves only the bilinear homogeneous forms }°27; y;. It follows that 
the h vectors a are not necessarily expressible as in the complex of the h vectors 
6, and vice versa. We may say then that the complex of the n vectors which 
determine a vector of grade n, and the complex of the n vectors which deter- 
mine another vector of grade n, may have with regard to each other a rank h, 
which is the order of two sub-complexes, one from each complex, which have a 
common sub-complex of order h. In the geometric case it is easy to see the 
significance of these facts. In the function spaces, however, they are also sig- 
nificant. When the general decretive expression vanishes, each scalar coefficient 
vanishes provided the vectors 8 are linearly independent. Hence all vectors 
a,and 8 become subject to the remarks just made. 

* This was presented to the Society in a paper read March 26, 1921, On Hamiltonian 
products, second paper. The contents are incorporated in this memoir. 
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9. In the alternant A, - a; --- a, we may substitute other vectors without 
changing the value of the alternant. These may be chosen so as to form a 
mutually orthogonal system if their intensities do not vanish. For it is evident 
that 


is orthogonal to a, and its substitution for a will not change the value of the 
alternant. We may now find a vector orthogonal to both a;, az to substitute 
for a3, ete. In short if we set 


= a1, 
/ 
Q2 Ay ay Ag a2/Ag a, a1, 
as = Ay A» * a2 Az * Ae a3/Ao As» 


we shall have A, a1 +++ = An+ ay 
The process would of course break down in case 


Ag+ aja, = 0 Ag+ Ao + a1 Ap + a2 = O, 


etc. But these are singular vectors and are excluded from the cases that are 
under consideration. This process of orthogonalizing vectors is of importance 
in the study of sets of functions, integral equations, etc. If the Ao process is 
defined as in the General Analysis of Moore, so that it is a Hermitian product, 
the vectors may always be orthogonalized. 

10. In case p is linearly expressible in terms of 81, --- , 8,, since we have for 
any set of n vectors a1, «++, @ (writing merely @ for a; a2 «++ a, and 6 for 


Bz Bn) 


Ay An . Bp =0= (Ao An aA, B)p 
= (Ao A, aA, By Bn—1 Pp) Bn + 


it follows that we have a means of expanding p in terms of the vectors £;, Be, 
+++, By if Ag - A, - aA, 8B 0; that is, if the set of a’s has rank n with 
regard to the set of 8’s. 'Transposing we have 


Bn Ao A, aA, : 


Ao An aA, By Bn p 
Br—1 Ao A, aA, B + 


We now set 


8* is called the complement of 8; as to the vectors a. In case the vectors a 


Apo Ae 

as — ay 

Ao 
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coincided each to each with the corresponding vectors 6 , then the complements 
would be called the adjuncts of the vectors 8, that is, the adjuncts are com- 
plements in the complex of the 8’s themselves. Evidently 


Ao B; bij, 
where 6;; is 1 if? = 7, and Oifi #7. The expression of p now becomes 
Di Bi Ao B* p = iB Ao Bip. 


This is the generalized form for the expansion of p commonly used in 
quaternions. 

The process used above for two complexes is called biorthogonalization of 
the vectors 8 as to the vectors a. The complexes need not be the same, but 
must have their rank with regard to each other equal to the order of the com- 
plex. That is to say, the vectors a must be expressible linearly in terms of the 
common incident complex and other vectors orthogonal to the incident com- 
plex. The same is true for the vectors 8. The vectors outside of the incident 
complex, however, may be quite different. As an example we may show that 


the sets of vectors 


Bi, Ay + a Ae + Bi Bo, A; + As + a a2 Ag Bi Be Bs, 
Ag + a3 Ay By Bo Bs Bs, 


A, + By As + a1 By Bs Ag + a1 a3, 
are biorthogonalized sets. 
11. We may now give the complete definition that states the result of using 
either the accretive or the decretive process on two alternants* (counting 
A, + £ as an alternant by definition). It could not have been stated before 


since some of the expressions on the right would have to be defined first. We 
define for all 7, 7 and any vectors a, 8, 


Ags an a Aj Br B; 


X (Ao+ Ag+ ai, +++ A Bj, Bj,) 


where ¢ we have k Si, k Sj, that is,i+j — 2k = |i—j|. The sign may be 
determined directly without using the factor containing the sign, by counting 
the total number of inversions, provided however that in the scalar factor 


()() the vectors a@ precede the vectors 8. This definition includes all 


Ag: 

* No special name is needed, as the next division shows. 

+ This X sign merely means that the expression should be written on one line. 
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the others given, as particular cases. In case k = }i, i Sj, and 8;, = a;,, 
ig = 1,2,-++,72, the expression vanishes. In case k = 0 we have but a single 
term, the sign being +. If k = 0 we have the accretive process. If the 
order of the two alternants is changed it is evident that the result is multi- 
plied by 

The vector just defined, of grade i + j — 2k, is the alternant which defines 
the region or space of order i+ j — 2k containing all the regions of the 
alternants of order i and j determined by omitting common incident regions 


of order j. 


4. THe HAMILTON PRODUCT 

12. Hamilton product of two vectors. Let V,, and V» be any two vectors of 
grades n, m respectively, that is, in the complex of a system of alternants all 
of grade n and the complex of a system of alternants of grade m respectively. 
Then we define Vingn—2k*V im Vn =D. Amin—2k* Yn, Where Vm =X 2m An, 
and V, = Sy, Az. This merely states the distributive law. We now 
define the Hamilton product (also distributive) for two vectors of grades m, n, 
indicating it by the sign * temporarily. We shall see later that no special 
mark is needed. We remember a scalar is not a vector, though written Ao. 


Via Vn = O<m,0<n, = 2* Van. 


From this definition we have first, since A; - a1 = a1, 
ae = Ao * Ae + Ao * Qe. 
Since the two parts of this product are of different grades, there is nothing to 
interfere with our using the same symbols in front of the Hamilton product as 
selective symbols to indicate the different parts. That is, we may use 
Ao ay * as and Ao Q1 Qo; 
As ay * ae and Ag * Ae 
indifferently. We shall see that this is a procedure which is general. 
Again we have 
ay * Ao a2 = Ay a a2 a3 + Ae 
= a3 Ao a1 G2 — a2 a1 a3 + Az + G3. 
Since a; * Ao a2 a3 = Ao a3, by adding, we have 
a, * ( a2 * a3) = a; a2 a3 — a2 Ap: a1 a3 + a3 Ag+ a + Ag * Ae 
This product consists then of two vectors, one of grade 1, the other of grade 3. 
If we use A; and A; in front of a Hamilton product as selective symbols, then 
we may define 
a, * ( a2* as) = a; Ao: ae a3 — Ae Ao * &3 + a3 Ag M2, 
Az + (a2* a3) Ag 


208 J. B. SHAW [ October 


Evidently 
Ay a* (a2* a3) = A, - a3*(a2*a). 
We have 


ay * a3) = Ay a1 * (a2* a3) + Az = * 03. 


The vector of grade 3 in this product is familiar. The vector of grade 1, how- 
ever, is a new function of a,, a, and a3. We may indicate it as a function of 
@,, @2, @3 by using the same symbol, omitting the multiplication sign*. The 
alternants we have already written in that manner, the mere juxtaposition of 
the vectors in A, - a; +++ @, not implying up to this point anything in the 
nature of a product. It has meant so far nothing more than a certain defined 
function A, - (a,, a2, -++, a) of n vectors, of first grade. Hence we will 
now define a new system of functions recurrently: 

+ An Angi * G2 
whereO =7. (Thelast term vanishes when = 0, since Qn = O, 
forn+1>n.) 


For instance 
Ay Ae a3 = A, Ap Q20a3 + Aj Ae A3. 


In the case of functional space the number 7 indicates the number of integra- 
tions we perform, there being in all n + 27 parameters which are set equal in 
i pairs in every possible manner, and then each pair integrated, leaving in 
every resulting term m parameters as indicated always by the subscript of 
the A. We see that now A has ceased to have only an alternating significance, 
retaining that meaning when the number of vectors following is the same as the 
subscript on the A. But when the number is greater (it never can be less), then 
A no longer is an alternating sign. The functions, however, do permit certain 
permutations depending upon the grade, as will be shown. 

13. We shall now prove that the definition just given for A, , when the number 
of vectors exceeds n, leads to an expansion in alternants of graden. Thatis, 
we shall prove the theorem: 


XK An-vi On (a=1,2,+-+,2). 


As a matter of fact again the signs follow the determinant rule, provided we 
place the subscripts after Ao in natural order, that is, have j; S je S +++ S jai. 


For let us suppose this formula holds for all values of n up to and including 
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n = t, and for all values of 7, such that 2i =¢. Then it follows that 


+ 


whenever ¢ is odd, and further when ¢ is odd, so that ¢ + 1 is even, from the 
definition in § 12, 


= Ay a1 Ag: a3 
Now by definition whether ¢ is odd or even 


= A 141-9; * A 42-9; + Au 1—2j ° A 


where 27 =t+ 1. But each term on the right contains a case that comes 
under the hypothesis since there are only ¢ vectors concerned in the second 4. 


Hence we may reduce the right-hand member, and 


A t41~-9; A 0-9; Ag: Ok, °° * Ake, (— 


where ky < ke < +++ < koj-o, hy < ho < +++ < hy;. 
The first term reduces to 
( 


A 41-9; Ao: ay Ao °°* 
koky-+-ko;—2 


where r is the number of subscripts k,, 2, ---, ky < ko in the term, and 
ey, +++, ko = 2,3, -++,t+ 1 in different terms. This reduces 


to 


0, 


in which the subscripts are now rearranged so that g < ky < ky < +++ < koje. 
Nok = 1. In these terms aq; is missing with 27 — 1 others, or 27 including a1. 
The second term reduces to 


where no fis 1, and hy < hy < +++ < ho;. 
In these terms a; is present, and 27 excluding a; are missing. But these 
two sets of terms together give exactly 


= > Ao Qk, eee Ak, 


( 


209 
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and hence the theorem holds for the next value of n = ¢ + 1 and for all values 
of j that are possible. Since, however, we know the theorem is true for n = 1, 
n = 2, n = 3, it holds for all cases of n an integer. The signs follow the 
determinant rule provided that under Ay ~- the letters are in their natural 
order, that is, as on the left. We see now from the definition of the Hamilton 


product of two vectors of any grades that 


+++» +44, or Ag: 
= Q1°°* t+ An-1 Q1°°* 


Now we know already that 


OQn—-1 An = Ao * An + Ap An—1 An 
and 


An—2 ( Qn—-1 Qn ) = Ag * An—2 An—1 An + Ay * An—2 An—1 An 


It is evident from the theorem just proved that we must have the general 
Hamilton product (in which the multiplier is set at the left) given by the 
theorem 


ok 


If the A’s are inserted in front of this product they are understood as selective 
symbols that pick out the term on the right of corresponding grade. We have 
now to show that this product is associative, that is, we may group the vectors in 
the order in which they come, in any manner, and find that the Hamilton 
product of the groups is the Hamilton product of the whole set. For instance 


[ar * (a2 * as) ]* (a * as) [(a2* a3) * as) ]=a1* (03* a5) 


14. Associativity of the Hamilton product. We shall prove this by showing 
that whether we multiply successively a, by a@n—1, then a@n-2, +++, by ai, or 
whether we form the product of a, by a@,—1, ++, by a;, and then the product of 
this by the product of a; multiplied successively by ai-1, ai-2, «++, by a1, 
we have the same result. That is, we are to prove that 

= A, - Qn + Ano An 


In the first place letn =i+ 7. Then 


Aiog a= ( = (14, leq) Ao * Qi, *** 
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where the subscripts written inside the parenthesis will indicate the missing 
vectors from --:,a;. Also we have 


‘ 
Aj-om O41 = ( — (jy jam) Ao * Oj, 
A -A i—2qg ° A; Aj—2m °° 


= ( A 54 A (24 lo 


x A j-2m (n> Jom ) Ao Qi, Aig, Ao 


> ( a= A j—2(m4 p+ q) Qe . “ere QAi+j 


Ao Qi; eee Ag Qj, a;,,, Ao 


Ay Gi. 1, © 


Let r = m + p + q; then we may condense the expression just written into 
(—)" (41 ji ** Jomtp) Ao Ap ai, 

If now we set g + m = h, p = r — h, and holding h fixed let ¢ and m take all 
possible values, we have from the above 
h=r l=h, m=0 


h=0 1=0, m=h 

herd 
where we must haves + ¢ = 2randr But finally summing 
as to h, we have exactly the sum 

which is the expansion of the Hamiltonian product of a1, a2, «++, a+; in the 
order written. On the left the final sums give the Hamiltonian products of 
Q1, M2, ***, @ and 

We see now that the special use of a multiplication sign is unnecessary and 
that in all the functions we have built up, with the use of the A and subscripts, 
the array of vectors may be considered to be a Hamilton product, and the A a 
selective sign for the vector of the indicated grade. The remarkable simplicity 
of this result enables us to dispense with any very large collection of formulas 
for transforming expressions in this general vector calculus. We may break up 
by associativity any expression under an A and arrive at various reductions in 
this way. As instances 

= Ag+ + Ao + Ao) + a1 [A2 + Ao] 
Ay + + Ay Bi Bo Bs Bs) 
= Ay (a1 a2 — Ao: a2) Ay + Bi Bo B3 By = Ao + (a1 A, Bo Bs Bs); 


Ao * Ae A3 Ay = Ao * Ae Ao * G3 


Ag * Ao * G2 4 + Ag * Ao * 3. 


Trans. Am. Math. Soc. 15. 
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Expansions of this sort are unlimited in number.* Other useful formulas are 
easily derived by combinations, as 


* An An+1 Br =(- pAo ym 
* An A, By Bn > ‘Ao An °°* & An pBx Bn; 


2 * Qe Ay Ae Ao Ay + Ay a3 As = 


15. There are other ways by which we might have developed the subject. 
We might have taken a method similar to that of Clifford and defined the 
symbols a, 8, ete., which we have called vectors, as the hypernumbers defined 
by the ratios of geometric vectors; but this method (and others) has great 
disadvantages in the end. We might of course have defined a general or 
indeterminate product and derived from this expressions which would have 
been our A expressions. Some of these will be exhibited later. We should 
have difficulties also this way. From a long consideration of the problem the 
method of development used was settled on as the most desirable from a good 
many points of view. 

16. We proceed to develop a few theorems useful in applications. If we 
consider two complexes of order n each, defined by ay, and +--+, Bn, 
and a vector p, the part of p in the incident complex of the two (say p’) is 
given by 


= Ay pAna)A,: B. 


For it is easy to see that 


and therefore the right side gives >>; Ao - Ag: - aA, - B, when 
we set for the complement of a; 


But the form we now have is that part of p which lies in the complex of a', a’, 
+++, a”, that is, in the incident complex. It follows that 

Ao fie aA, B 


» *Conversely every expression A,»,- a +++ a, may be expressed as a sum of products alone 
without any A’s. See page 220. 


Ay: 

p A, A, 
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is the part of p (its projection) exterior to the incident space. It may be also 
written in the form 


A; [pAo + An @An (Ana pA, - a) An 8] 
Ag An aA, 8 


17. Another useful form is the following. Since A; + + 81 +++ An + @ 
is linear in a, +++, Q,, we see that if we consider a set of vectors defining a 
complex of order r, a1, Qn, ***, then 


identically. This is true whatever the vectors 8. Hence for any vectors 
ths s 


Ba~1 An ) A, Gy | 0 


identically. We may now change the order and have still identically 


We may however drop the first A,,; since Ag in front prevents any additional 
terms from entering, and then for the same reason we may insert A, as shown, 
arriving at a formula from which many useful formulas may be deduced: * 


Ao A, a, A; [Ay (An-1 By 
18. A similar formula may be deduced from 
which is linear in A, - aj, +++ a;,. If then we consider the set a1, +++, @ziyte 
where z may be zero, we have the identity 
Hence whatever the vectors 71, +++, Yez+y+2, We have identically 
whence for all §’s and y’s we have 
(Ay By Asty ery) °°* = 0. 


Particular cases occur for z = 0. 


* Volterra, Rice Institute Pamphlets, vol. 4 (1917), No. 1, p. 58 (8). 
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19. In the expression 


= Dt Ao: ai, +++ Bry 


the vectors a that enter the right-hand side are all different, and there are not 
more than r, hence we have 2r —-x —~ySrorr=a2+y. Further if the j 
set and the 7 set do not together make up the whole r set, then there are inevi- 
tably duplicates in the last factor which would cause it to vanish. Hence the z 
set and the j set must cover all the r vectors a1, --- , a,, though they may over- 
lap, in which case in the last factor the overlap would be missing. In each A 
we may reduce with no alteration in value by substituting an orthogonal set 
for the vectors appearing there (indicated by accents). The factor Ag would 
then vanish if it contained any vectors 8. Hence the vectors of the 2 set and 
those of the 7 set must define the same complex. ‘They must therefore each 
be the whole set a1, ---, a. Hence for any value except « = y = r the 
expression vanishes, and when x = y = r it reduces to 


Bi Bi. 

20. If we consider the bordered determinant 
Ap: ab Ag Ao aBe 

Ag: 716 Ao+ 


Ao- Yn Be 


we see that it may be condensed into 
If we expand by Cauchy’s theorem the determinant is equal to 

where A is the determinant of 8 and y alone, that is, the one bordered, and A;; 
is the minor of A produced by omitting the column containing 6; and row con- 
taining y;. Now interchange a@ and 6 and subtract. The first terms cancel 
and the others become 
Ag ° Ay abAs B; Ai; = > Ao Ao 


This is easily reduced to 


—— 
Ao 71 Bn 
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Hence we have the useful theorem 
21. We notice that 
An By — Bn* a Ans (A, By Bn* a) + An-1 (A, 


Hence we have since Ansys @Any1 @() = 0, and + a() = O, 


a* A, Bata 
= (- [An ° apy coe Bn - aA, By cee Bn] 
= (—)"[Ao Bi +++ Bh — An aBe 
Bah + 2A0 + An + afi Brn] 


It is apparent then that if we multiply an alternant of grade n by the same 
linear vector on the right and left, we have a vector of grade n, which is the 
part of grade n of the product of the original product by this vector on the 
right and the left. 

Since we have Ao af; Bn a = Ao + «++ B, and since 

aa = Ag: aa, and a: Bi By @ = Bi Bn, 
it is evident that for any expression 
(m =n) 

we have 

a* Am Bi Bata = Am * Baa. 
A vector of grade 1 which comes at both extremes of any vector of any grade of 
a product may be set outside the vector symbol of that grade. This enables us 
to make reductions. For instance 

= BY; 

aByaé = aaAyg + By 6+ By: ad. 
This theorem is useful in studying invariancy under rotation about linear 
vectors. The papers of Joly and M’Aulay should be consulted. 

It is evident that continued application of the theorem is possible. For 
instance 

22. Since the formulas we are to consider in this section are homogeneous in 
all vectors on both sides, we will consider that the vectors are wnit vectors, that 
is to say for each vector we have 


= Ay = 1. 
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We may introduce multipliers easily to produce vectors not unit vectors. 
We will occasionally indicate the opposite of a vector by a stroke over the 
symbol, thus 

—-a=a. 
We have then aa = —1. We shall use the stroke extended to cover more 
than one vector to indicate that the order is to be reversed and all the signs, 
thus 


This is called, in quaternions, taking the conjugate. 

This operation will not affect the form Apo + a; +++ a, for this vanishes 
when 7 is odd, and if n = 2m we have by definition of Ag + a; +++ aem 

Ag+ Q1°** Qom = (—)? Ao aj, aj, Ao Qi, Aj, Ao * Qi, 
where the subscripts are so arranged that 7, < j. (x = 1,2, +--+, m) and pis 
the number of inversions in the total collection of subscripts so arranged. It 
simplifies the count a little to arrange either the subscripts 7 or 7 in their 
natural order. It is evident now that as each factor may have its vectors 
interchanged, and as a change of both signs does not affect the value, we will 
have in any case 
Ag+ Q1 Gn = Ap? ar 

Hence if we remember the expansion of the form A,» + a, «++ a,, we see that 

as every term is an alternant of grade m, and as 


we shall have 
A, = ( ome ) dm (m+1) °Q1°** An. 


Hence we have the four forms for any product 7 = a, «++ a», according to 


form of n, 


Agm = Asm 7; — = Agmyi 73 
It follows that 
= (Ag+ — Ay — + + — — 


Hence 

+m) = + 

= if is of even grade, 


-)a if + is of even grade, 


if is of odd grade, 


+7) = + 
h(a = (Ag+ + 47° 


-)r if + is of odd grade. 
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We may also state the theorem thus: * in Ag, Ay, «++ we may reverse the order 

of the product, in A;, As, --- we may reverse the order, in de, Ag, «++ and in 

A3, Az, +--+ we must change the sign if we reverse the order. We may also 

state it thus: the substitution (1,”)(2,n —1)(3,n — 2) --- leaves invariant 

Ao, Ag, +++, A1, As, «++ and changes the sign of As, As, ---, Az, Az, 
Again we notice that if u is a vector of grade m, then 


= Amy1* wa + pa, an = Amy: + Ani: ap, 


since all other grades vanish, and by the formula in § 11 we have when we 


interchange the vectors 


m 


wa — pal, 
™[ Amir ap — ap). 


an = (—) 

pa = (—) 

This gives us at once 
pa+(—)" ap 
pa —(—)" 


2Am—-1° wa = (—)™ 2An 1 > ap. 


We have from this, reverting to 7 = a; «++ a, and writing 7, = A, - 7: for 
brevity, 

We also have 


ar = ar, = [Asi1 ams + amg]. 


It is clearer if we write these out in full, retaining only the terms actually 


present, 
am = Ag+ am; + + 72) 

+ Ap: a(m +73) + As + a(me +74) 
ma = Ao: am; + A1 a(m — 7) 


— Ag+ a(m — 73) + — + eee, 
Of course when 7z is even all the even terms are zero in these, and when 7 is 
odd all the odd terms are zero. We have from these, by combining, 


[am + ma] = Ag: am + At ato + Ao + + AZ + ame + 

[am — ma] = Ay ate + Ap am, + amy + Ag+ + 
We have a number of interesting results from these. For instance, taking Ao 
of both sides we find that 


Ag: 7a = Ao: ar. 


Hence Ao is invariant under the substitution (1, 2, ---,). This substitu- 
tion taken with the other which leaves Ao invariant gives a group of order 2n. 


*Since Ao(aj— 7) = 0, Aa- (a — 7) = 0, ete. 
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Since in the expanded form, where n = 2m, we see that there are 2” (m!) 
substitutions which leave the individual terms unchanged, and there are 
(2m)! substitutions altogether, there will be in the development of 
Ao & 


(2m)! 


= (2m —1)(2m terms. 
2” (m!) 


There will evidently be for n = 2m vectors of grade 1, $(n — 1)! different 
expressions Ao a1 Qn. 
Again we have 


Ay + Twa = Ay aw — 241+ ate = — at + 2am; 
1 
Ao + ma = Ag+ aw — am, = — Ag+ aw + aT; 


A3 = Ag = 2A3 = Az + 2A3 * 


Am + 7a = aw — ATma1 = — Ams aw + 2Am 
where the upper signs are used if m is odd, the lower if m is even. Returning 
to our previous procedure we have 
pag = - ap + ap = pAo ap Am-2 ° ag 

+ Am - a8 + + of, 
pAg : ap + (Ao ap ) 
— An + a8) Amis (M2 a8) 
aBu=pAg a8 + Amo a8) Am: (Ae a8) (Ao + a8) 
We have at once 
pap = — 2A», aB) 


whence easily 


Trap = abr — 2>> Lm (Ale a8) tm. 


We need not proceed in this fashion, for it is simpler to make use of the con- 
jugates, thus: let 


then 


and since 


we have the general formula 


at attr = Am: (re (—) Fz). 


Now by expanding each expression in the form 


T= An and T= Bp; 
TT = Am TH, 
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and similarly for 7, and cancelling all terms that must give 0, we arrive at any 
of these forms directly. The terms retained are of the form 7, 7, where 
m=|p-—s\, pts=m. 
If we take conjugates of the expression just written we have 


By combining these formulas by addition and subtraction we arrive at a 
variety of forms. 
Applying the same method to three products as factors of a single product 
we see that 
We combine this with orr = SA, - orr to produce various formulas. In 
particular let and 7 be vectors of grade 1; thena = — = — 7,so that 


tro = >, (—)immt) - 

We have then 

ort t+ tro = >. An F] 
Remembering the grades of ¢ and 7, the only possible grades for the [] which 
will not give zero terms are the grades m — 2, m,m-+2. The bracket there- 
fore reduces to 27,, for the upper sign and 2 (am~2 + 7m+2) for the lower sign. 
Hence 

ont + tro = An 

ont — = Am + [O%m—2 T + OTm42 T). 


We finally reduce these as follows: 


+ = 2); Am O[Ami1 * + Am-1 Tn T) 
= > (—)*" A, o[ Amis THm — Am-1 * 
= + Ag+: oT — Am: GAm-1 
— Am + TAm-1 OTm), 


ont — Tro = + Am (Ao or) (tm—2 + 


These formulas give the effect of transposing two vectors in such a product. 
They are very useful For instance when we introduce Y¥ it will be seen that 
we may set V fora, and for VAo - 7() + 7Ao - V() we may write 2¢9(). If 
then we set 7}, for the value of z when in the vector of grade m in each term of 
the expansion for tm (say Am + Y1°** Ym) We write ¢o, thus: 


Am $0 (91) ¥2 Ym + Am* ¥1 G0 (Y2) Ym 
+---+A/,,- go (Ym); 


we have 


Var + = + Vr F2> zh, 
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where V acts only on rt. The expression Ao - Yr will be called later the 
divergence of tr. Likewise As - Yr will be called the curl of r. We see then 
that when curl r = 0, Yar = trV. We cannot elaborate on these, but this 
single example will show how simple the transformations are without any 
great display of formulas of reduction.* 

A few cases need to be written down, however. They will explain them- 
selves. We use 7 and 7 as products of grades 7 and j respectively, and o as of 
grade 1: 


ar. — 
) 


= 


+ (—)'oAin; 75); 
+ 
23. We consider now more particularly Ag - a; +--+ ae,. In the first place 
from the development it is obvious that the form is a pfaftian. Hence if we 
square it we shall have a skew determinant of even order, that is, 
(Ag + Gen)? 
Ag+ a3 +++ Ag+ Con 
It is evident in this form that advancing each subscript cyclically is equivalent 
to moving the first row and column to the last row and column, which does not 
change the value of the determinant. Reversing all rows and columns does 
not change the determinant. The number of positive terms in the pfaffian is 
one greater than the number of negative terms, hence not all terms can have 
their signs changed without changing the value. This shows again the permu- 
tations possible in this form. If any vector is orthogonal to all the others the 
pfaffian vanishes. From the form orr + rzo if m = 0 we have 


Ag: omr + Ag+ tro = TAQ: oT. 


Hence if ¢ and 7 are orthogonal the right side vanishes. Hence interchanging 
two orthogonal vectors in the Ao - form merely changes its sign. 
Let Ao- () represent Ag+ a1 +++ Gn, (th, 2, ***, represent 


Ag+ Gon, m<n, and let (—)**Ay- be the minor of 
tg, fom 
A): a;a. It is clear that if we differentiate the expressions Ao - () as to 
i 
*By means of these forms we can reduce every expression Am-+a,-+-a, to sums and differ- 
ences of products alone, thus Ao-aya2 = }(aya2 + aza;), = } (aya, — ayara; 


(m7; + 7; = 
(—- 
| 


1922 ] GENERAL VECTOR CALCULUS 


Ao + a; a partially, we have (—)**" A,- (i, k). If we differentiate the 
determinant in the same manner we have 2( — )*** A;,. Hence 
Aix = — Ag+ (1, k). 
From a well known theorem in determinants if 

is the determinant produced by erasing from the original determinant the rows 
indicated by the 7 subscripts, and the columns indicated by the j subscripts, 
then we have 

In case the subscripts 7 and j are the same, A is a skew symmetric determinant 
also and is the square of Ao - (71, +++, %m). The determinant on the left in 
any case reduces to 


0) (14 is ) Ag: (71 Um ) 
— Ao Ag (%2 im ) 


(—)™[do- 


This vanishes if m is odd, giving a theorem as to A, and if m is even the de- 
terminant is the square of a pfaffian, giving a series of theorems about expres- 
sions built up from the forms Ao - (2, 7) in the same formal manner as the 
forms Ap are built up from Ag - a; a;. We cite only the following: 
Ay (a, b) Ag (ce, d) Ao ° (a, c) Ao ° (b, d) 
+ (a, d) Ag+ (b,€) = Ag+ (a,b, c,d) + (); 

Ao (a, b) Ag (ec, d, e,f) Ag (a, b, Cc, d, e,f)Ao 
They are easily written and extended to all orders. 

24. Space does not permit further developments here, but enough has been 
said to make it easy to continue. The particular forms that are related to the 
linear vector operator, whose general form would be 

= Lado - B(), 
furnish a complete part by themselves. When the number of vector terms 


becomes infinite we have functional transformations, and functions of lines 
with related subjects enter. This development is deferred to later papers. 


5. THE DIFFERENTIATOR 


25. We should however mention here the differentiator (corresponding to 
the Hamiltonian nabla) related to a vector p = }-2; a;, expressed as depending 
upon parameters, 7 = 1,2, +++, 


V = > a; 0/d2;. 


221 


J. B. SHAW [ October 


This symbol may be used anywhere just as if it were a vector, for all vector properties, 
and is supposed to differentiate in situ any vector dependent upon the parameters 
entering p. It differentiates also in situ any function or operator dependent upon 
these parameters. ‘This one simple rule enables us to write out at once an 
indefinitely great number of forms for differential work. No special collection 
of formulas is necessary. The mere fact that VY may follow an operand is a 
principle that introduces great simplicity. We observe here that in the case 
of functions the process of differentiation of a function of a line used by 
Volterra is a case of the use of a Y in which the number of parameters is non- 
denumerably infinite. 

It is evident that we have 

d+ () = () 
for any operand dependent upon the parameters ofp. (Note the parameters of 
p belong to the complex in which p is variable, and do not mean the coérdinates 
of p, which would be usually more numerous and might be infinite in number.) 
A simple example of the use of Y is shown in the following. 

If we have a pfaffian differential form w = SCy; dx; we know that it may be 
reduced to a set of terms udv + pdq + rds +--+. In case there is but one 
term and that is dv, the expression is called integrable or exact. If the single 
term is udv it is said to be integrable with a multiplier, w. The number of 
such terms was shown by Pfaff to be not more than half the number of param- 
eters x, if this is even, and not more than half the number plus one, if the 
number is odd. 

Suppose we can reduce w to udvy. Then we have by setting ¢ = uv 

Vo = Vuvr + where = VV, 
whence 

Ag + = Ac’ Vo, and A3:oVo = 0. 
This is a necessary condition of integrability with a multiplier. In case u = 1, 
we have as the necessary condition A, - Yo = 0. Considering likewise the 
next form 
>= udv + ds = Ag+ dp(uYot+ Vs), 
we set 
c=uytys, 
whence 
Vo = + 
2. A3+oVo = VsVuve, VoVo = 0. 


This problem has been studied by Cartan* who gives the conditions above in 


*Annales de 1’ Ecole Normale (3), vol. 18 (1901), pp. 241-311. 
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scalar forms which, owing to the arbitrary differentials, are equivalent to 
these: 
Ao Ax Vode pizp = 0, 
Ao A3 + oVoAs 5; pb3 = ete. 

The expression Ag - Vo is called the divergence of ¢, and Az - Va is called 
the curl of ¢. 

We have other differential expressions, as for instance in the ordinary 
relativity theory in four variables, we consider A; - Yoo called the curl of the 
2-vector (vector of grade 2) o2. The complement or adjunct of this in the 
region of order 4 is called the Lorentzian of g. It is A; + A3 Yoo Ay ws. 

We may build up expressions with Y which are covariant for unique and 
invertible transformations @(p). Such are 

Ag+ Ao A3 di pds pd3p, 
In fact the possibility of expressing a function in such wise implies certain 
covariant properties. 

26. In general if we write the form Ao - dpa, and consider the equation 

Ay: dpc = 0, with Aony1 Yo = identically, 

we undertake to find v such that Ao, - YeoVo --- Yo = 0, where both forms 
A are alternants. From the latter, multiplying by dp and taking Ao,_; + we 
have 
Aon—1 dpAon + VroVo +++ Vo = 0 = dvAnn_ +++ Vo 

— Ao + dpgAm—1 +++ Vo 

+ --- — Ag YroVo + 

= dvAon oVo — (n — 1) Ag+ 

+ (nm — 1) YodoVo +++ + +++, 


Set dv = 0, so that one solution is v = const., and the remainder of the ex- 


pression furnishes an equation which must hold for arbitrary differentials of 
the parameters of p. Setting their coefficients equal to 0, we are able to find v. 


For instance let us have 

V3 die Ve + (ay v3 as) dis v3 dis = (), 
We have o = 21 + 2% + (21 + ay + a5, the vectors 
a, ***, @5 being constant. It is easy to show that A; - eYoVo = 0 and four 
variables will suffice. ‘To find v so as to reduce the expression, we notice that 
since the form will be udv + rds 

= + rvs, As Vo = Ay Vuvv Ao vrvs, 

A3 + = rA3+ VsVuvo + Vrvrvs. 


Either Vv or Vs is common to both, hence 4, - YooYo = 0. We assume the 
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form for v such that 

Vo = V2 Qe + U3 + U4 + 
and substituting all values we have a vector of grade 4 which vanishes so that 
each coefficient of the alternants A, - a; a2 a@3a4, Ag+ 0305, etC., 
vanishes. These are here 


V3 — Xe Ag Xp Vo + Ay Y = 
— Xo as V5 are Vy — Vg + (273 4 — — 
From these we have 
— Ao Vo + 2373 = O, 
These are satisfied by taking either = 21/23, Or a2 + a4, OF a%4 25. 

The equation Ay - dpa = 0 means that dp is everywhere orthogonal to the 
congruence given by sle dpa = 0. This is called the characteristic congruence. 
If the equation is integrable with or without a multiplier, the congruence 
satisfies Ao + Yo = Oor As - oVo = 0, and this makes it a normal congruence. 
That is, there is a function of n — 1 of the parameters, f, such that Ag aVf = 0. 
The integral in fact is f = const. The other cases, Ay - Yo Vo = 0, ete., give 
special congruences worthy of study. 

27. The integral {Ay - odp depends generally upon the path of p, that is, 
p is taken as an arbitrary function of a single parameter, s, p = p(s), and dp 
becomes p’ ds. The terminal values of s furnish the limits. The integral 
evidently depends in general upon the form of the function p(s). If the 
initial and the final value of s are the same we say we have integrated around a 


loop, and indicate the fact with the sign g. If we follow any path for which 


we have Ag - odp = 0, it is evident that the integral from any point to any 
other is zero, and such paths are orthogonal lines for the characteristic con- 
gruence. In case they can be collected upon spreads (functions of one or more 
parameters) these functions are called solutions of the pfaffian form. 

If we follow a loop or closed circuit in the integration, and choose a spread 
of order 2 (a surface), attending as usual to singularities, we may introduce 
elementary areas and circuits, arriving at the generalized Stokes’ Theorem, 

+ adp = Ar dy pdop, 
over the area. This theorem still applies in case the line-integral is not taken 
over a circuit, if we can draw through the terminals of the actual path orthog- 
onal lines as mentioned above so as to complete a circuit, since the integral 
along the lines introduced is zero. If the curl of ¢ is zero at all points of the 
area the loop integral evidently vanishes. 
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6. INTEGRAL INVARIANTS* 


28. Suppose that we have a general function linear in dp, Q(dp). If we 
integrate this over a loop the form of Stokes’ theorem is 
Q(dp ) SSY (A, dy pds p), 
where the accent shows the operand of the accented VY. This may be con- 
sidered to be the difference between two distinct paths from an initial point to 
a terminal point chosen on the loop, the double integral being taken over the 
enclosed area. If the paths differ infinitesimally it becomes the variation of 
the integral for a variation in path. Therefore if 
( (A, = identically, 
the variation is zero, and the integral is called an integral invariant for the 
variation in path. We do not necessarily have to retain the same end points, 
for we may move either along any line which satisfies the equation Q (dp) = 0. 
When the integral is invariant it is called the integral of the exact differential 
form Q(dp) = 0. For instance, A» - adp is exact, the integral being Ag - ap. 
a (Ay VAo[]) = 0, since is constant. 


In this case we have identically A,» 
The lines which satisfy A, - adp = 0 are p = sa. The path could be moved 
parallel to a, leaving the integral invariant. 

29. It happens sometimes that an integral remains invariant when the 
whole loop is displaced, as for instance the strength of a vortex tube. The 
general condition is obtained thus: let the integral be taken over a region of 
order n (that is, there are m independent parameters and these are varied 
between appropriate limits), 

I, = Ff Q (A, d pds p d,p), 
where /§” means n integral signs or an »-fold integral (the iteration of an 
integral in order to procure an area-integral, or a space-integral, is incidental 
here and may not be necessary). Suppose now that we displace the region so 
that p becomes p + 6p; then we may consider the effect of this on any function 
of p by noting that 


6 = 
Hence 
= Ao: 6pVS" Q( A, ip +++ dnp) 
= J™ Ag 5p’ (An +++) +S" Q(An dh bp dz p +++ dnp) 
+S" Q(An: dip da dp) 
S™ Ao (An dip +++ dip) 
+ (An dp!Ana +++ dep). 


*The contents of this division were included in a paper presented before the Society, 
Chicago Section, January 2, 1913, Integral invariants in general vector analysis. 
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This is called the first form. Integrate the last term by parts, combine, and 
we have 


= SJ" Q’ (An dip +++ dnp) 
+ S*[Q(An 5p) + dip +++ dip. 
This is called the second form. The expansion of the last term gives a number 
of terms each of which is integrable once, and we have 
= Q’ (An + + bp dip -++ dy p) 
+ 1(QAn (Qn Ant dip +++ dup. 
Let dp follow a path so that dp = odt; then we have 
= Q’An Angi odip +--+ dyp 
If we had retained the first form we would have 
If now J, is an absolute invariant, we must have for the paths given by o 

or for the second form 

Q’ (An +++) + (QAn + + V’An +++ = identically. 
These conditions are very general and include every form as usually stated. 
As an instance, consider the integral f-4 9 - rdp. This will be an absolute 
invariant for the congruence a if 

Ag 7() + Ag: () = O. 
The condition may also be written 
= or or) = — Ay- Yr. 

In case the boundary is fixed, then at the boundary 6p = 0; if the integral 
is to remain invariant for all ¢ , we see from the second form that as the second 
integral is now zero, and the first holds for all o, we must have 

Q’ (An +++) = identically. 
For instance, in the example just considered we would have 
Ag+ 7'A, +++ = identically, 


whence 
Vr =O identically. 


The situation considered here is the extension from an integral over two 

distinct paths, with fixed terminals, to an integral over two regions of order n, 

with fixed boundary. Such an integral is indicated by Goursat* by J?. When 
*Journal des Mathématiques (6), vol. 4 (1908), pp. 331. 
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the boundary is closed we have simply an invariant, called a relative invariant, 
indicated by J,. 

30. In a manner similar to that for Stokes’ generalized theorem we have 
Green’s generalized theorem 


Tn = = dip dip) 


The second integral is evidently of the type J¢. Since now a relative invariant 
integral may be written as an integral of order higher by unity, the condition 
of its being invariant may be stated. That is, 


Jn = dap) 
is a relative invariant if 
= SQ! (An V’Angi dip p) 
is absolute, that is to say, if 
Ag+ Q’ (An + (An V’Angi V" Anyi) = 0; 
or if 
Q’ (An V’Anti V’Ante + (Q’An V’Anti V" Ant = 0. 
This gives us the condition (since A, A» V’ V’Anye vanishes identically) 
For instance in the case considered above 
Hence, reducing, Az - VA1 - Yr = O identically. 
31. Since we may always write 
Ag: (Q’An + QAn CAn-1 An-2 V'An-1 
= Ag: oV’ Q(An* Ani V'Ane), 
the condition above becomes 
Ag (An ()) + Q(An V'An GAn-1) = 0. 
Hence if J, = §” QA, is an absolute invariant, 
Tra = S™*Q(An + 
is an absolute invariant. 
32. From any integrand Q(A, ---) may be formed by two processes two 


other integrands, one by the process D, giving Q’ (An - V’An4i +++), and the 
other by the process E, giving Q( A, + @An-1 +--+). In case the latter form 


Trans. Am. Math. Soc. 16. 
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vanishes identically, we mark the integral J, of Q (A, ---) with an e, thus J%. 
In any case the second application of the EF process gives an identically vanish- 
ing form, hence from any /, the E process produces an J{_,. Integrals are of 
four types with respect to these processes, that is, according to whether or 
not they can be marked with d ore. The effect of the D process on the inte- 
grand is to mark the integral with d, of the E process to mark the integral 
with e. We need to prove that if an integral invariant has either character 
the other process will not destroy this character. That is, 
= 

E produces from I‘ an J“, , that is, if the first is from an exact integrand, the 
second will also be exact when the first is invariant, and also will be invariant. 
For from 

(A, +++) =0 
and 


(A, Ani es -) + (QA, V’An 0 


it follows that 
(QA, a)’ Vv’ 0, 
which is the condition that the result have an exact integrand. To be in- 
variant we must also have 
which is evident. Again when QA, -o--- = 0 and J, is invariant we have 


also 


showing at once that Q’4,- VY’ --- is of type E. That it is also invariant 
demands that 

Anti V’ Ante + (Q’A, V’ Anti = 0, 
which is evident. Hence the two operations are permutable when applied to 
an invariant, one giving Q’ Ay V’Anyi GAn «++, the other 


(QA, 


If Q is invariant, however, the first expression is the negative of the other. 
The characters are then unchanged. If we start with an invariant and apply 
the E process we arrive at an J* of one lower order, usually not identically 
zero, and if we then apply the D process we have an integral of character [** 
of the original order. For instance, from Ag « tdp we have, by D, 


— Ag+ Az di pdep, 
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by E, Agro. Using the other process in either case we have from the first 
— odp, from the second (Ag - dp. In case now 
Ao - tdp is an invariant the new forms are also invariant. 

The D process is a sort of generalization of getting the curl, while the E 


process fixes certain paths for integration, along which the lower integral may 
be displaced. 
As instances, let J; = f-Ao « tdp, and let J have both characters d, e; that 


means 


and for a certain congruence given by dp = edt, 
Ag -ta = 0. 
The expression is exact on account of character d; hence the integral is 
u = const. 
This is a function of p, and such that 7 = zyYu. Hence we have also Vu 
everywhere orthogonal to the congruence line ¢, by the second condition. 
That is, o gives lines lying in the spread u = c. 
Let 


Ie S'S he dip dep; 
then 


Ao Ay * Ay VA3 = 0, Ao Ay * Ao oA, «= identically. 
We have easily - Yro = 0, Ao Az Ap od = O for every that is, 
Ay = 0 identically. 


From the first condition 


= Ae VE; 
hence 
Ay ° VE = (), 


Put dp for cdt, and we have along the congruence 
A, + dpAz VE = O,7 or dé = V'Ag dp. 
This means, if we set ¢ for the linear vector function VA - £(), 
dé = gdp. 


We have then formally dp = ¢' dé. The complete discussion is long. 

A comparison of the methods used here and those given in the coédrdinate 
treatments of these problems will show the enormous gain in simplicity. 

33. To solve the differential equation 


Q An + = 0 


Vr=0, 
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means that we must find a congruence of curves such that if dp is taken along 
any curve the equation will be satisfied, and that x(p)Q is an absolute in- 
variant for the curves. That is, setting dp = edt, 


Ansar = (), 
(2Q)'A, VAnis + (aQA, V’A, =0 


= + Ao: cV log x QAn. 
Hence we must have 


for some ¢, and ¢ may be zero. For instance let Q be Ay - dp = 0. The 
condition is then 


Ay: o& = 0, Ao Ap VEA2 () = tho: E(). 
For a particular case let £ = A, - pA2 + ap, where a is constant. Then 
Ay + VE = ap, Ac + opAs + ap = 0, + + ap = tA; ap. 
These are satisfied if we take 
o = tp + any vector orthogonal to a and top. 


Hence dp must be in the plane through the origin containing the common 
perpendicular to a and p. Hence dp is in any plane tangent to a cone of 
revolution about a as an axis. This cone must therefore furnish an integral, 
that is, 


Ao: pp = (Ao ap). 
34. A system of equations 
0 = dp = Ao &: = En dpp 
is equivalent to a single equation 
An-1 + dpAn + £1 +++ = An-itnadp = 0. 
The conditions then reduce to 
= 0, Ag GVAn-1 () tn — Ao* () VAn-1 = tAn-1 () th. 


We cannot enlarge on the use of these forms in the study of differential 
equations, but enough has been said to indicate the procedures. We will, 
however, show the application to differential geometry. 


II. DIFFERENTIAL GEOMETRY 


1. Let p be a vector dependent upon n parameters 1%, U2, -**, Un, thus 
defining a region of order n. For instance if n = 1 we would call the region 
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a curve, if nm = 2, a surface, etc. We have then 
p(u, U2, Un). 


2. If now we let the parameters vary and assume that the function p is 
differentiable as to each parameter, indicating the derivative by a correspond- 
ing subscript, we have 


dp = + po duz + + pn dun 


where p; = 0p/du;, and the n vectors are in general linearly independent. 
We will indicate the adjunct of p; in the region by p* so that 


= ( A, y P1 P2 °°* Pi—-1 Pi+1 °°° Pn A, * P1 P2 

eee Pn/ Ao On An pi eee Pn- 
For convenience we shall write vy = A, p: --+ p, so that v is an alternate vector 
of grade n, which is also a function of the point at which it is calculated. We 
also set (I - v)* = Ao- w, and Uv = r/Iv is a unit vector. Where vy = 0, we 
have singular points. 

It is clear that 
Ao: pip’? = 0, if and Ao: pip'=1. 

Further 


A, + +++ = v/Aow, 


which is seen by expanding each in terms of p1, p2, «++, pn and applying 
the theorems of determinants. 

3. The expressions Apo - p;p; for7, 7 = 1, ---, n are called the fundamental 
quantities of the first order. They are commonly represented by a;;. The 
corresponding forms Ao - p‘ p’? = a” are the adjunct quantities of first order. 

It is clear that 


4, The biorthogonal systems p; and p‘ are useful. Suppose for instance we 
have a system of values /, which we attach to p, giving a vector 
A= Ps ls; 
then we may define a new system of values 
m = Ao: 
and from the properties of the vectors p we have 
m,p, = pr =X. 


The coefficients m and the coefficients / therefore give the same vector when 
attached to the proper vectors. For the vector p in terms of p;, and the vector 


231 


232 J. B. SHAW [ October 


p in terms of p* we may introduce a new system of n independent vectors defin- 
ing the region, and their biorthogonal system. As we are dealing with the 
vectors themselves we shall not be concerned with their mode of expression. 
Such questions however play an important part in the developments of Ricci 
(Lezioni sulla Teoria delle Superficie) and those who follow him. 

5. If we have a vector £ = >-X, 9", and in the other form, § = /\X"p,, 
then £/Ip, is such that X,/Ip, = the orthogonal projection of — on p,. In 
other words the orthogonal projections are given by Ag Up, £. Similarly for 
the other form. 

6. If we differentiate Ao p, p* = 0 or 1 we have in either case 


Ao prt p® + Ao pr (p*): = 0, 
a useful formula. From it we have the Christoffel * form 


t = Ao prs p' = — Aopr (p')s = — Aops (p‘)r. 


Evidently 
> pt Ao * Pr (p‘)s + Ay UvAnss Uvprs 
pr Ao -ps(p')r + A: UvAn+1 Uvprs. 


The second termf in each form gives that part of p,, outside the region of order 
n. The other Christoffel form is 


rs 
| { |- Ao * Prs Pt- 


7. Since pipe = 0 we have 


or v(p™)2 = — Angi: 
and likewise 
Ans * = — Angi’ Vr Ps = — Anji * Vs Pr- 
There is usually a non-vanishing expression of first degree which is orthogonal 
to the region under consideration (pi, «++, pn) 
We note that Uv might as well have been used here as vy. We can construct 
n* expressions 
Ao: * Prs VAns1 VP tus 
which can also be written 
Ao Ansi* ¥tPu, OF Ag+ pr Ar OF Ao Anti * 


* This is easily reduced to the usual cartesian form by writing out p-;in terms of pi, -*-, pn- 
For the other Christoffel form expand p,, in terms of p’, «++, p”. 

+ It is to be noted that in general p,,; has components not linearly expressible in terms of 
p1, ***, Pn, a8 for instance for a surface, which is imbedded in a three-dimensional space. 
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and in other forms produced by permissible interchanges of the subscripts. 
However this is a quadratic in p,, p, of the form Ao p, dp. , where ¢ is a linear 
vector function. Its transverse would be 


Ao Ansa Vs Pu- 
We will set 
(Uv)s Ans: A (Uv), () = Ns 
A, (Uv)t (Uv), () Nt Nis. 


The difference of these two forms has received a symbol due to Riemann: 
(ru st) = Ag: Pr [( Ur)s An+1 ( Ur). 

( Uv): ( Uv )s] pu Ao * Pr 
There is also another Riemann symbol * of order four, 


{ru, st} = pr Nee — pr Nis = Ao pr 


where e(N) = 3(N — N),N being the transverse linear vector function of N . 
A better form may be given these by utilizing a previous formula,} which gives 


{ru, st} = Ao Az pr pu (Uv), (Ur): 
= — Ao: prs UvAo put Uv + Ao pre Uv Ao pus Uv; 
{ru, st} = Ao (Ur), (Ur), 
= — Ao: prs UvAo Uv + Ao prt UvAg: (p“), Ur. 


From these expressions it is evident that the two symbols in each couple may 
be interchanged by changing the sign, and the two couples may be 
interchanged. 

8. The expression Ao dpdp = Ao p; pz du, dus is called the first funda- 
mental form. In case the Riemann symbols vanish it can be reduced to the 
sum of not more than n squares, that is, new variables may be found in terms 
of which the parameters « may be expressed so that, for the new derived 
vectors p’, we will have 


Ao dpdp = (du’.)?. 


The condition is both necessary and sufficient. If some or all of the Riemann 
symbols do not vanish, then it will be necessary to express Ag - dpdp as the 
sum of not more than n(n +1) squares. From one point of view this is 
equivalent to considering the region to be embedded in a region of not more 
than 3n(n +1) dimensions in which we may take the directions of p; as 
orthogonal. 

For instance a curve on an ordinary surface has a normal in the surface and 


* These may be identified with the forms commonly used by substituting and reducing. 
t Ao Az prpuAz(Uv),( = + (Uv): Aopu( — Aopr( Uv), Ao pu( 
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also has the normal to the surface as a normal, or indeed all lines in the plane 
of these two. Hence the ds? is expressible in terms of three differentials in 
ordinary space which encloses the surface, in the form 


ds? = dz” + dy? + dz’. 
On the surface alone it would usually have terms of the form 


du*?, dudv, dv’, 


and would not be reducible to the sum of two squares. 

9. We may improve the forms we have reached so far by the introduction 
of the operator which generalizes the Hamilton VY. On account of this general- 
ization we shall use V in a general sense. It is defined thus: 


p' 0/du;. 
It follows that d() = Aodpy - () for any function of the parameters u only. 
Hence we will have 0/du,; = Ao - pr V -; therefore we may write for (Ur), 
(Uv), = Aop, V: Ur. 
The operator 
Ay()V-Uv=@ 


plays a very important part in all these problems and gives of course the 

vector rate of change of Uv in the direction given by the unit vector operated 

upon. With this symbol we may write the Riemann symbols in the forms 
(ru, st) = Ao Ae Pr Pu Ao Pps 

with the understanding that s, ¢ may be interchanged with r, wu; 


{ru, st} = Ao As pr p™ Az Pps 


The number of Riemann symbols with two pairs of identical subscripts is 
3n(n — 1), with one pair of similar indices } (n — 1) (n — 2), and with all 
different —1)(n —2)(n —3), giving a total of 
It is clear that for any four directions in the region we may have a symbol 
Ao Ao aBAy Py5, which for unit vectors may be called the Riemann symbol 
for those directions. There is also, if we take differentials for the intensities 
of the vectors, a differential quadrilinear expression 
G, = Ao: Ap: d' pd" p p Pd’ p. 


Since every symbol is invariant for a change of parameters, this is a covariant. 
We see at once that for any six directions we have a similar form 


= Ago: di pdzpdz3p 


and+there will be higher forms as high as G2,. These are all covariants, and 
are generalizations of the Christoffel quadrilinear covariant G;,. 
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10. From these we derive covariant quotients, which give us in the first 
instance the Riemann curvature for the plane of d; p, d2 p, and by generaliza- 
tions, Riemann curvatures for the regions of d;p, d2p, +--+. These are 


Ag+ Am +++ dm pAm Pd; p --+ Bdm p 
Ky, (di dy) = —— 
) Ag+ Am: d; p +++ dmpAm d; +--+ dnp 


When we arrive at the covariant of order K,, it is evidently the single coefficient 
of the Christoffel form for the same order. This number is called the 
Kronecker-Gaussian curvature of the space. 

11. If we construct the second differential of p , d? p, we have two parts to it, 
namely, that part which has components in the region, and the part which is 
orthogonal to the region. Since we have identically 

Anyi + Urdp = 0, 
it follows that 

Ansys dUvdp = — Any + p. 
But 
d?p = p; Ao: p + Ay» UvAny: Uvd? p 
= p; Ao: p — Ay UvAny1 dUrdp. 
We set now 
£ = Ay + UvAny, dUvdp. 

It is evident that Ap - p; = 0 fori = 1, ---, n, so that & is not in the 
region. £ is called the vector second differential form since the coefficients of 
its expansion are scalar second differential forms.* We evidently have 


The coefficients A; - UvA,,,; (Uv),ps are fundamental vectors of second 
order. They depend upon the parameters and are not covariants, but they 
furnish a number of covariants. We see that 


Anis = — Anyi + Uvé. 
Also 


Ay + dUvdp — Ay + UvAns Uvé. 
But in general 


A, + Ap On Anyi Be Bn 
hence 
Ay + UvAnsy; dUvdp = dpAg UrdUv + Ay + (An-1 dpA, + Uv) A, dUv 
A, + (An-14 + dpA, Uv) A, dUv, 


*Cf. Wilson and Moore, Proceedings of the American Academy of 
Arts and Sciences, vol. 52 (1916), pp. 270-368. 
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and also 
A, + Uvt = + + EUV) An + Uv. 


But & is orthogonal to all vectors p, hence A,-1 - Uv = 0. Hence 
= Ay ° dpUv) A, ° dUp. 


If we go back to the theory of curves we see that if we write the differential 
arc in the usual form dp = ds - a,* then d?p = d? sa + dsdaand as Ay ada = 0 
it follows that the part of d?p outside the region consists of part of the term 
containing da. If da = cBds, = Ay An + UvAns1 - UvB - ds*,, which is 
the part of 6 - c - ds? orthogonal to every vector p;, «++, pn- 

12. It is evident that the first differential quadratic and whatever is deriv- 
able from it of a covariantive or invariantive character will be true of a region 
which is flexible but inextensible. That is to say, bending the region would 
have no effect upon the first fundamental form nor upon the quantities derived 
from it without differentiation. However the second fundamental vector 
form does not possess this character and demands a rigid space. This is due 
to the fact that d*p contains a part which is not in the region itself. For 
instance if we are dealing with a curve in three or more dimensions, the first 
fundamental form is merely ds, and as long as we do not consider d? p we 
merely work in the line itself which for all such purposes might as well be 
straight. If however we consider d?p = d*?s - a+ ds -da where a is the 
unit tangent (or better the hypernumber belonging to that particular tangent), 
then we must either abandon the line or consider da = 0 which makes the 
line really straight. 

It is to be noticed that the expressions (Christoffel symbols) Ao - p;s pz: can 
be found from the region itself, since 


2Ao prs pt = (Ao pr pr)s + (Ao ps pt)r — (Ao pr 


all of which could be computed in a practical case from measurements inside 
the region. From these we can compute the expressions Ao -p;sp'. The 
Riemann symbols are also computable in the same manner, since 


(rt, su) = (Ao prs — (Ao Pru Pt)s 
+ [Ao p* (Ao Pru pn Ao Pst Pk — Ao Prs Pn Ao * Ptu 


Covariants built out of these forms can likewise be computed directly in the 
space, and would be independent of any particular directions such as r, s,t, u. 

13. If a = Udp, then da has two components, one in the region, and one 
exterior to the region. The latter is £/ds. The former is called the geodesic 
vector curvature of the curve of which a@ is the tangent. If it vanishes the 
cutve is called a geodesic for the region. This may also be found as a curve of 


* Where a is the unit tangent vector. 


bo 
Oo 
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minimal length. For let ; 
s= f VAo - dodp; 


= [ dpdp = f 40 dipdp/ds = dipa. 


Integrating by parts, the first part being then exact and vanishing for an 
extremal, we have as the condition of a minimum (or maximum) Ao dpda = 0 
for every dp in the space. That is to say, we have da wholly exterior to the 
space. We might also write this fact in the form 


A,-1daA, Uv = 0. 


then 


It is understood here of course that the da is for a displacement along the 
curve. 
14. The covariant G; can be written 


G; = Ag Ag dy pd. pAs ds Uvd, Up 


and is thus dependent upon the four directions of differentiation. Now we 
have 


dUv = d(v/Iv) = dv/Iv — UvAg Urdr/Iv. 
Hence as As An a1 Gn An *** Qn = 0,* and therefore A, - = 0, 


= Ao As d; p dz p[ Ae dz v dy v 
— A, + Uv(d3vAo+ dyv — dyv + Ag+ d3v) Uv]/Ao + vv. 


If we let four tangents to the space, that is, four unit directions, be 
Q1, Q2, a3, ay, We have 


K, = Ao Ao * Ae Ao az a4/Ao . As * Q1 Ae Ao 


The Riemann symbol (rs, tu) may be computed from observations in the 
space as follows: 

We find first 

(Ao Prt Psu — (Ao * Pru Ps)t- 

Then we find in the region Ao p’, p.. — Ao * pu ps: Which is subtracted from the 
other value ($12). The difference is (rs, tu). The accents on p indicate that 
we are measuring so much of p,;, etc., as we observe in the region in question. 
In the first expression these accents are not needed, as the form shows that the 
other parts of the second derivatives drop out anyhow. If we multiply by 
du,, and du; and sum for all values of the differentials we have 


d, Ao ds Pr Ps — dz Ag di py Ao: ds p, d, + Ao: dy pr ps. 


Ao An An a, = Ao a@1°** Qn An-1 An Ag aj aj, which will 
j 


contain (Az a; aj + Az a; a) Ao An-1 Gn *** and thus all terms cancel. 
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Multiply again by du, du, and sum again, and we have 


G, = d; Ao: dz pds p — d, Ag+ dy d; pd, p 
— Ag + dz dz p’ di dsp’ + Ao + di ds dy p’. 


This is the best form for this quadrilinear covariant, since by choosing the 
directions of the four differentials properly we may obtain any onefof the 
Riemann forms. 

15. The Riemann-Christoffel tensor Gf can be written in the form 


(Ao + + Yi V2 Ao ¥5)”, 


where Y; acts on y only, Ye on 6 only, a, 8 are purely arbitrary, the double 
accent means that only components exterior to the space are concerned. 
a,8, 7,6 are unit vectors. If this is to vanish for any a, 8, then 


(Az + Vi V2 Ao = 0, 
and this may be written 
(As Vo, 5)” = 0, 
where ¢, is the derivative dyad of y, or ¢, {dt = d; y, the differential of y in 
the ¢ direction. In other terms ¢, = Ao- ()V- y and ¢, = VAo- ¥(). 

If this vanishes for every 6 we must have (A2- Vy)” =0. This can 
vanish for every y only when all the Riemann symbols vanish, which inter- 
prets the meaning of the vanishing of all these symbols. 

16. The Einstein gravitation equations translated into our notation are 
Gis = — — Do Ao pix 

+ (Ao + + Ao: pi; 
= — Ap — Ao pi; — Ao pix 

+ (Ao + + 04; (p*)e 
= — Ag: Ao: Pix (p*); (> Aop* pix); 
= — Ag+ — pix (p"); + pix (p*); + Ao oF 
= — Ag+ + Vois = (Ao + Voi)” = 0 for all 7. 


Therefore the significance is easily seen to be that the convergence of the 
exterior parts of the second differentials is zero. That is, if a is a unit tangent, 
Ag+ Vda" = 0. Hence the sum of the normal curvatures of n orthogonal 
geodesics at the point is zero. The Einstein curvature is 


The Riemann-Christoffel covariant is written also without the accents (since 
components in the space lead to terms that cancel) 


Ag+ Az + Vi V2 Ao: 


1922] GENERAL VECTOR CALCULUS 239 


and if we substitute for y the adjunct of 8, and sum for n linearly independent 
values 8; with 6*, we have the Einstein covariant in the form 


Gas = >» Ag Ay ab; As Vi V2 Ag Bi 6=0 for all a, é. 


This gives Ao Ae a8; As where operates only on the 8*. This takes 
the form 


> (Ao Ao Bi — Ao B' 


which is another form of the potential. 
If we start from the forms (Ao - pir p*)" — (Ao prup*)? and, making 
$ = u, sum, we have 


Gre = Ao + Vert + Ao pre (p*)s = (Ao Ver): = Ao Vor 
as before. 

The simplest way to get the second derivatives is to choose paths in the region 
on the geodesics. There is then no normal in the region. If then the con- 
vergence in the region of these geodesic normals vanishes, the space is of the 
gravitational character. 

If the Riemann-Christoffel covariants vanish for an ordinary space we know 
it is a plane, and a surface nabla of the normals (which would run out into 
three-dimensional space) all vanish. To be an Einstein space of two dimen- 
sions it would be necessary for the convergence in the space of the geodetic 
normals to vanish. For a sphere, for instance, the normals all pass through 
the center, their differentials have any direction on the surface, and would 
not necessarily give a convergence equal to zero. A sphere is therefore not a 
possible Einstein surface. 


TENSORS 

17. As commonly used the term tensor means merely a multiplex of coeffi- 
cients obtained when we substitute in a lineo-linear function of m variables the 
various vectors chosen m at a time from the set pi, ---, pn. Weshall use = to 
indicate a tensor, which from our point of view is an operator. To indicate the 
operands we will insert m parentheses after =. A tensor is an extension of the 
term dyad, triad, etc., used by Gibbs. We will write then a tensor of order 1 
as =(), one of order 2 as Z=()(), ete. The first fundamental form is a tensor 
since it has the form Ao-()(). From it we derive the set of quantities 
Ao: prps. It is clear that all our A symbols, A,,, --- furnish tensors. A 
tensor is a covariant tensor when it reduces to a form independent of p1, «+, pn- 
All the symbols A and their combinations furnish tensors that are covariant. 

A tensor is a derivative tensor, and of order higher by unity than another, 
when it is formed from the other by the process 


240 J. B. SHAW [ October 


For instance from the tensor £ (a covariant, as p1, --+ , p, are not in evidence) 
we derive £49 V() where V operates only on &. If & = VX then we have 
VXA,V(). If isa differential operator, as Ag - () - then we derive 


Ay: Ao: VC). 


If we indicate direction by a subscript as o,, then the derived tensor demands 
that ¢-; = ¢sr. This condition is necessary. It is further a sufficient condi- 
tion that = be a derived or differential tensor. 

All differentials or derivatives taken in this manner without reference to 
the parameters wu are called covariant differentials. The so-called contra- 
variant systems would be found by using the system p’. 

18. To make translations from the ordinary notations we have to notice 
that the covariant tensor A,, means the form Ao - p* () Ao p” (), the 
contravariant tensor A™” means pz()Ao- py(), and the mixed 
tensor means Ao () Ao: py (). 

The so-called product of two tensors of orders a, b is a tensor of order a + b. 
For instance the product A,, Bj means 

D Bi Ao p* () Ao p” () Ao p* () Ao ps ()- 
In expressions where “dummy ’”’ indices appear, that is to say, duplicated 
indices, the expressions are summed as to the operational terms indicated, for 


instance, means Airs Ao - p“ () Ao: () p*() Ao: pe ()- 
We will translate some of the current notations into these forms: 


Quy, w = w] = Ao: Pw; {uv, w} = puvp”, 

Ai = A‘ Ao p*(), (A'); = 2) At Ao: 40°11), 
where the subscript 1 shows the operand of VY. This is covariantive 
differentiation. 

With these indications it is easy to reduce any expression into the vector 
forms, and then to reduce the vector forms to compact expressions which 
from their forms alone show that they are covariantive. 


CURVES 


19. Let a be the unit tangent to a curve (that is, a function of one parameter 
8), which gives 4g - aa = 1. We have therefore at once 


Ay: ada =0. 


Hence de is orthogonal toa. Let 8 be a unit vector so that 


da = ciBds. 


¢; is called the curvature and 8 the principal normal. Usually part of 8 is 
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in the space of the curve, part not in the space. For instance, a curve ona 
sphere may have its principal normal lying in the embedding three-dimensional 
space. We may also write, then, 

da = (mrAX+nv)ds, 


where now g; is the geodesic curvature, n; is the normal curvature, d is a unit 
normal in the space, and y is a unit normal out of the space. 

Since Ay - a8 = 0, differentiating again, Ap - adB = — Ag+ Bda = — c}. 
Hence we have for d8 (the dp is still along the curve) 

/ ” 
dB = —cjat+ciy, 

where ¥ is called the binormal, and cj is the second curvature. Since the 
first term gives the entire projection on a we must have Ao - ay = 0,.and 
since Ag - Bd8 = 0, we must have also Ag - By = 0. Continuing we find the 
trinormal, etc., and the third, etc., curvatures, giving the generalized Frenet 


equations: 
da = c} Bds, 


dB = — ci a)ds, 
dy = — c{B)ds, 
du = «ds. 
When cj = 0 the other curvatures become indefinite but are taken as zero by 
convention. If cj ¥ 0 and cf = 0, from the equations above 
A; dpd* pd*p = 0. 
The condition is necessary and sufficient. Hence, differentiating, 
A; pd‘ p = 0 = Az - dpd? p, 
and all higher curvatures may be taken as zero. 
20. A congruence of curves is defined by the equation 
A, + = 0, 
where X is a given unit vector in the region.* Let us consider n independent 
congruences, one curve of each congruence through any one point, and at the 


point let all the congruences be mutually orthogonal. To indicate the 
orthogonality we shall use a instead of \. Then 


Ao “Aza = bin, 


and using ¢q, to indicate Ap - () V - a;, the operator that gives the differential 


* The notation of Ricci is as follows in vector forms: dijr = Ao + i Upr, XY = Ao + Aap", 
where 7 and h define different congruences; Aijrs = Ao pr(Ai)sy Vigk = Ao Aj 
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of a; in the direction of the unit operand, we have 
Ag Ai Pa, Ao ° Qj Da; Xk for all Aj k. 


Such an expression is called a coefficient of rctation for a reason that will appear. 
There are }n? (n — 1) such expressions. We have also 


Ao + On ba, = 0 if h#l. 


We may expand Ap da, ps in terms of Ap a; pr Ao aj Ps Ao * Ai ba, 
This is the Ricci formula 

but such expansion is not needed in the vector forms. 

21. Let 

= ai ba, () 
be an operator which converts vectors of grade unity into vectors of grade 2. 
Of course it is a very special operator of this kind. The coefficient of rotation 

= Ao Aj a; Ag 

We see that Odp = $>0A2 - a; da; gives what may be called the instantaneous 
rotation for a displacement dp about a plane. That is to say, the plexus of 
congruences rotates about this plane through an infinitesimal amount given 


by the intensity of the 2-vector. (Rotation about a plane or other spread is 
possible in hyperspace.) We can look at the situation in this way: let 


and Ap ax tT = Ao: = Ag+ cda;,/dt,Ag- or =0. The coefficients of 
rotation are the components in the directions of the elementary bivectors 
Ae + a; a; of the bivector 

We can also express 6 a little differently in a form useful in the study of 
n-tuply orthogonal congruences. * 


Let 
the accents showing the operand of V. Then 


The expression A» - Va; is of course the curl of a;. 


22. If f(p) = 0 is a spread of order n — 1, the normals are Vf and form a 
normal congruence. Since fix = fin, we have 


Ao a VAo+ ar — Ao: ax f =0. 


*See Shaw, Triply orthogonal congruences, these Transactions, vol. 21 (1920), 
,P. 391. The present articles contain the results of a paper read December 31, 1915, before 
the Chicago section, On orthogonal congruences in space of three and more dimensions. 
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As ¥ operates upon following a’s we have from this 
Ao Vii Da, Ak — Da a) =0. 
This might be written 
(Yith — Vf = 0. 
Let the unit part of Vf be a,; then the equation gives 
Ao: (Qn Ga, — On Pa, = 0. 
That is, for the entire set of curves in the spread and normal to it, 
Ynkh = (kh, 1,2, ---,n —1). 
The equation also states that @., is self-transverse (symmetric) so far as the 
spread is concerned. The condition is also sufficient. For when it is satisfied 
Ae an Vao = Oforallh,k =1,---,n—1. But 
As Van aj A» An Aj, 
hence we have 
A; * An Van = (), 
whence tA» Van = As fa, for any Choose = — Vt and we have 


Ao V (tan ) = UP tan Vf. 


From this we have also As - V log ta, = — As + Van = Ao: a, 7, Where g is 
in the spread and is indeed @q, Qn. 

23. Let us suppose now that a, is the principal normal of a system of curves 
through the point, that is, a; is the unit tangent, da;/ds for dp along the curve is 
Where? = 1,-+-,n—1. That is, 

ba,% =Cia forall t=1,---,n—1. 

Hence 
Ao On ba,ai = 0, Ag Qj ba, = — An ba, a; = 0 

=1,---,n—1). 
This may also be written to parallel the treatment in three variables referred 
to above: 

A, a, 9a; = 6; On, 
Ao Qn ba, Ai = 


If the curves all lie in a spread as in the preceding article, 


V log t = da, + 
Ao On ba; = Ao * On ba, 


In the general case of all the curves merely such that a, is their common 
principal normal, the sum of the curvatures is 


Ao On >, ba, (t=1,---.n). 
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which reduces to 

= Ao * An Ay a; 9a; or > Ay Aj Pa, Ao Vao- 

This shows on the one hand that the sum of the curvatures of the norma! 
curves is independent of the particular system chosen, if they are mutually 
orthogonal, but on the other hand it gives significance to the geometric con- 
vergence of a. Dividing by n — 1 would give the mean curvature of the 
system. 

The second curvature of the curve with a; as tangent is 

” 


Since a and ¥ are orthogonal, 


” 
Ao a; Da, = Yi; 


and, we may write 


is the intensity of. As a; ba, Qi. 
24. The spread of order n — 1 is called isothermal when 
Ao VVf =0=V'f. 
The resulting equations are simple. 
25. In case the curves in the region through the point in question are 
geodesic we have $a; @ = 0 in the region, that is, for every 7 
Ao + ba, = 
That is, 
Ao On ba, An = O 
for everyi. This may also be written Ag - a, a; 9a, = 0. This form enables 
us to state the condition in the form A, - a, 8a, = 0 in the. region, and this 
vector must lie outside the region therefore:* 
Further applications must be deferred. 
*See page 213. 
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